Python Classifier.from_name Beispiele

Programmiersprache: Python

Namespace / Paketname: classifiers

Klasse / Typ: Classifier

Methode / Funktion: from_name

Beispiele auf hotexamples.com: 2

Python Classifier.from_name - 2 Beispiele gefunden. Dies sind die am besten bewerteten Python Beispiele für die classifiers.Classifier.from_name, die aus Open Source-Projekten extrahiert wurden. Sie können Beispiele bewerten, um die Qualität der Beispiele zu verbessern.

Häufig verwendete Methoden

Anzeigen Verbergen

Classifier(10)

fit(3)

from_name(2)

evaluate(2)

train(2)

run_classification_with_cv(1)

plot_with_error_bars(1)

predict(1)

predict_one(1)

preprocess_data(1)

read_excel_file(1)

read_separate_train_test_files(1)

read_text_file(1)

train_mod(1)

test(1)

svc_sigmoid(1)

save(1)

naive_base(1)

save_features(1)

save_word_dictionary(1)

score(1)

selected_split_name(1)

set_features(1)

set_patience(1)

plot_f1_scores(1)

load_word_dictionary(1)

make_new_run_file(1)

decision_tree(1)

GaussNB(1)

LR(1)

RFC(1)

SuppVM(1)

apply_feature_set(1)

classify(1)

compile(1)

create_dictionary(1)

create_weights_array(1)

evaluate_gender_prediction(1)

logistic_regression(1)

fit_generator(1)

get_names(1)

grid_search(1)

init_model(1)

kNN(1)

k_nearest_neighbors(1)

load(1)

load_model_npy(1)

DTC(1)

train_test_split(1)

Beispiel #1

Datei anzeigen

def train_and_test(df, preds, seed):
    '''
    Run a single trial:
        Shuffle df and split it into training and testing subsets
        Train a new model based on the training sets
        Test the model with testing set
        Add prediction data into preds array

    :param df: dataframe with full set of all available samples
        columns: id, cat1 (primary class), cat2 (secondary),
        title, titlen (claened title)
    :param preds: an array of predictions, each prediction is a dictionary
        cat: true category, pred: predicted category,
        conf: model confidence in its prediction (< 1.0),
        title: actual title of the chapter/sample
    :return: average testing accuracy
    '''
    ret = {}

    # PREPS
    # randomly split the dataset
    df = utils.split_dataset(
        df,
        settings.CAT_DEPTH,
        settings.TRAIN_PER_CLASS_MIN,
        settings.TEST_PER_CLASS,
        settings.VALID_PER_CLASS,
    )

    # TRAIN
    classifier = Classifier.from_name(settings.CLASSIFIER, seed)
    classifier.set_datasets(df, titles_out_path)
    classifier.train()

    df_test = classifier.df_test

    if settings.EVALUATE_TRAINING_SET:
        evaluate_model(classifier,
                       classifier.df_train,
                       display_prefix='TRAIN = ')
    accuracy = evaluate_model(classifier,
                              df_test,
                              preds,
                              display_prefix='TEST  = ')
    classifier_key = utils.get_exp_key(classifier)

    classifier.release_resources()

    return classifier_key, accuracy, classifier.df_train

Beispiel #2

Datei anzeigen

def prepare_dataset():
    '''Convert input .txt o .csv into a .csv file with all the necessary
    columns for training and testing classification models.'''

    # # experimental work done on first, small dataset.
    # utils.extract_transcripts_from_pdfs()
    # utils.learn_embeddings_from_transcipts()

    # load titles file into dataframe
    df_all = pd.DataFrame()
    for fileinfo in settings.DATASET_FILES:
        if not (fileinfo['can_train'] or fileinfo['can_test']):
            continue

        titles_path = utils.get_data_path('in', fileinfo['filename'])

        if not os.path.exists(titles_path):
            utils.log_error(
                'The training file ({0}) is missing. See README.md for more info.'
                .format(titles_path))

        df = utils.read_df_from_titles(titles_path,
                                       use_full_text=settings.FULL_TEXT)
        for flag in ['can_train', 'can_test']:
            df[flag] = fileinfo[flag]
        df_all = df_all.append(df, ignore_index=True)

    # save that as a csv
    df_all.to_csv(
        titles_out_path,
        columns=['id', 'cat1', 'cat2', 'title', 'can_train', 'can_test'],
        index=False)

    # normalise the title
    classifier = Classifier.from_name(settings.CLASSIFIER, None)
    df_all['titlen'] = df_all['title'].apply(lambda v: classifier.tokenise(v))
    classifier.release_resources()

    return df_all