Python GenotypePhenotypeMap.read_dataframe примеры использования

Язык программирования: Python

Пространство имен/Пакет: gpmap

Класс/Тип: GenotypePhenotypeMap

Метод/Функция: read_dataframe

Примеров на hotexamples.com: 4

Python GenotypePhenotypeMap.read_dataframe - 4 примера найдено. Это лучшие примеры Python кода для gpmap.GenotypePhenotypeMap.read_dataframe, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

GenotypePhenotypeMap(30)

get_neighbors(4)

read_dataframe(4)

_data(4)

read_json(3)

wildtype(2)

_mutant(2)

_add_n_mutations(1)

to_json(1)

to_excel(1)

to_dict(1)

to_csv(1)

site_labels(1)

sample_genotypes(1)

_add_mutant(1)

read_csv(1)

_mutations(1)

mutations(1)

_encoding_table(1)

_add_binary(1)

add_missing_genotypes(1)

_wildtype(1)

_site_labels(1)

_rebuild_map(1)

genotype_is_in(1)

Пример #1

Показать файл

def read_file_to_gpmap(
    input_file_name,
    wildtype=None,
):
    """Read the input file for GPSeer.

    This should be a CSV file with the following columns:
    genotypes, phenotypes, n_replicates, stdeviations
    """
    df = pd.read_csv(input_file_name)
    required_columns = ["genotypes", "phenotypes"]
    optional_columns = ["stdeviations", "n_replicates"]
    for c in required_columns:
        try:
            df[c]
        except AttributeError:
            err = "input file ({}) must contain a column labeled '{}'".format(
                input_file_name, c)
            return AttributeError(err)

    # If wildtype is not given, use the first genotype in the input file.
    if not wildtype:
        wildtype = df.loc[0, 'genotypes']

    # Fill in missing columns for the GenotypePhenotypeMap
    for col in optional_columns:
        if col not in df.columns:
            df[col] = None

    gpm = GenotypePhenotypeMap.read_dataframe(df, wildtype)
    return gpm

Пример #2

Показать файл

    def fit_transform(self, X=None, y=None, **kwargs):
        self.fit(X=X, y=y, **kwargs)
        ypred = self.predict(X=X)

        # Transform map.
        gpm = GenotypePhenotypeMap.read_dataframe(
            dataframe=self.gpm.data[ypred==1],
            wildtype=self.gpm.wildtype,
            mutations=self.gpm.mutations
        )
        return gpm

Пример #3

Показать файл

    def fit_transform(self, X=None, y=None, **kwargs):
        self.fit(X=X, y=y, **kwargs)

        linear_phenotypes = self.transform(X=X, y=y)

        # Transform map.
        gpm = GenotypePhenotypeMap.read_dataframe(
            dataframe=self.gpm.data,
            wildtype=self.gpm.wildtype,
            mutations=self.gpm.mutations
        )

        gpm.data['phenotypes'] = linear_phenotypes
        return gpm

Пример #4

Показать файл

def split_gpm(gpm, idx=None, nobs=None, fraction=None):
    """Split GenotypePhenotypeMap into two sets, a training and a test set.

    Parameters
    ----------
    data : pandas.DataFrame
        full dataset to split.

    idx : list
        List of indices to include in training set

    nobs : int
        number of observations in training.

    fraction : float
        fraction in training set.

    Returns
    -------
    train_gpm : GenotypePhenotypeMap
        training set.

    test_gpm : GenotypePhenotypeMap
        test set.
    """
    train, test = split_data(gpm.data, idx=idx, nobs=nobs, fraction=fraction)

    train_gpm = GenotypePhenotypeMap.read_dataframe(train,
                                                    wildtype=gpm.wildtype,
                                                    mutations=gpm.mutations)

    test_gpm = GenotypePhenotypeMap.read_dataframe(test,
                                                   wildtype=gpm.wildtype,
                                                   mutations=gpm.mutations)

    return train_gpm, test_gpm