Python get_clean_data 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: pre_process

메소드/함수: get_clean_data

hotexamples.com에서의 예제들: 5

Python get_clean_data - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 pre_process.get_clean_data에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

0

파일 보기

파일: main.py 프로젝트: avratech7/CORE

def system_first_uploading():
    '''
    this func is being called only once when system first uploads
    this func uploads documents with label to DB
    :return:
    '''
    try:
        clean = pre_process.get_clean_data("https://en.wikipedia.org/wiki/Sport", "sport")
        text_arr = []

        for i in clean.clean_data:
            text_arr.append([i, clean.label[0]])

        for i in text_arr:
            query_doc.save_docs_into(i[0], i[1])

        clean2 = pre_process.get_clean_data("https://en.wikipedia.org/wiki/Medicine", "medicine")

        text_arr = []

        for i in clean2.clean_data:
            text_arr.append([i, clean2.label[0]])

        for i in text_arr:
            query_doc.save_docs_into(i[0], i[1])

        query_doc.con.conn.commit()


    except Exception as e:

        print(e)

예제 #2

0

파일 보기

파일: main.py 프로젝트: avratech7/CORE

def system_first_uploading():
    try:
        clean = pre_process.get_clean_data(
            "https://en.wikipedia.org/wiki/Sport", "sport")
        text_arr = []

        for i in clean.clean_data:
            text_arr.append([i, clean.label[0]])

        for i in text_arr:
            query_doc.save_docs_into(i[0], i[1])

        clean2 = pre_process.get_clean_data(
            "https://en.wikipedia.org/wiki/Medicine", "medicine")

        text_arr = []

        for i in clean2.clean_data:
            text_arr.append([i, clean2.label[0]])

        for i in text_arr:
            query_doc.save_docs_into(i[0], i[1])

    except Exception as e:

        print(e)

예제 #3

0

파일 보기

파일: main.py 프로젝트: avratech7/CORE

def finds_users_input_subject():
    users_new_url = input("please enter url:")

    clean_user_text = pre_process.get_clean_data(f"{users_new_url}", "")
    print(
        find_tf_idf.finding_label_of_new_file(
            [clean_user_text.clean_data[4], ""], query_doc.get_docs()))

예제 #4

0

파일 보기

파일: main.py 프로젝트: avratech7/CORE

def finds_users_input_subject():
    """
this func gets user's URL and hopfully returns if the subject is sport medicine or unrecognised
    :return:
    """
    while (True):
        try:
            users_new_url = input("please enter url:")

            clean_user_text = pre_process.get_clean_data(f"{users_new_url}", "")
            print(find_tf_idf.finding_label_of_new_file(
                [clean_user_text.clean_data[0] + clean_user_text.clean_data[1] + clean_user_text.clean_data[2], ""],
                query_doc.get_docs()))
            return
        except Exception as e:
            print(e)

예제 #5

0

파일 보기

파일: main.py 프로젝트: avratech7/CORE

def uploading_more_ducs_to_system():
    """
    this func uploads the system with more docs beyond the docs  which already exists
    """
    new_url = input("enter a new URL to update the docs in system")
    label = input("enter the URL subject")
    try:
        clean = pre_process.get_clean_data(new_url, label)
        text_arr = []

        for i in clean.clean_data:
            text_arr.append([i, clean.label[0]])

        for i in text_arr:
            query_doc.save_docs_into(i[0], i[1])
    except Exception as e:

        print(e)