Python Config.fix_heller_dates Examples

Programming Language: Python

Namespace/Package Name: utils

Class/Type: Config

Method/Function: fix_heller_dates

Examples at hotexamples.com: 1

Python Config.fix_heller_dates - 1 examples found. These are the top rated real world Python examples of utils.Config.fix_heller_dates extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

Config(30)

config_path(9)

create_api_client(5)

defaults_guild(5)

all_guilds(3)

getInstance(3)

att(3)

embedding_name(3)

config_via_prompt(2)

batch_size(2)

epochs(2)

__init__(2)

EnvConfig(2)

Nout(2)

features_config(1)

drop_remainder(1)

device(1)

saveConfig(1)

df_dim(1)

dfc_dim(1)

loadConfig(1)

distances(1)

domains(1)

edges(1)

dump(1)

epochs_train2(1)

embedding_dict_to_use(1)

folds_number(1)

epoch(1)

flip_inputs(1)

fix_heller_dates(1)

epochs_interval(1)

epochs_interval_evaluation(1)

epochs_number(1)

defaults_member(1)

core_thresh(1)

data(1)

all(1)

Dataset(1)

EnvConfigApp(1)

OUTPUTS_DIR(1)

Session(1)

_cfg(1)

_get(1)

_recursive_merge(1)

_save(1)

_simplify_defaults(1)

add(1)

all_members(1)

cuda(1)

Example #1

Show file

        # Put into a dataframe:
        df = pd.DataFrame(data=contents[1:])
        df.columns = contents[0]

        # Reindex using `docid`;
        # This lets us gauge posting dates even if that's missing for a doc:
        dfindex = []
        for doc in df.docid:
            dfindex.append(int(re.sub('[a-z]*$', '', doc)))

        df.index = dfindex
        df.sort_index(inplace=True)

        if file == '43-heller.csv':
            df = Config.fix_heller_dates(df)

        # Subset to 2013-2016:
        df['form_date'] = pd.to_datetime(df['form_date'], yearfirst=True)
        validdates = df['form_date'] >= pd.to_datetime('2013-01-03',
                                                       yearfirst=True)
        lastvalid = max(validdates.index.where(validdates == True))

        df = df.loc[:lastvalid, :]

        # Drop empty docs:
        notempty = df['clean_text'] != ''
        df = df.loc[notempty, :]

        n_docs = len(df)