Python GenotypePhenotypeMap.read_dataframe 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: gpmap

클래스/타입: GenotypePhenotypeMap

메소드/함수: read_dataframe

hotexamples.com에서의 예제들: 4

Python GenotypePhenotypeMap.read_dataframe - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 gpmap.GenotypePhenotypeMap.read_dataframe에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

GenotypePhenotypeMap(30)

get_neighbors(4)

read_dataframe(4)

_data(4)

read_json(3)

wildtype(2)

_mutant(2)

_add_n_mutations(1)

to_json(1)

to_excel(1)

to_dict(1)

to_csv(1)

site_labels(1)

sample_genotypes(1)

_add_mutant(1)

read_csv(1)

_mutations(1)

mutations(1)

_encoding_table(1)

_add_binary(1)

add_missing_genotypes(1)

_wildtype(1)

_site_labels(1)

_rebuild_map(1)

genotype_is_in(1)

예제 #1

파일 보기

def read_file_to_gpmap(
    input_file_name,
    wildtype=None,
):
    """Read the input file for GPSeer.

    This should be a CSV file with the following columns:
    genotypes, phenotypes, n_replicates, stdeviations
    """
    df = pd.read_csv(input_file_name)
    required_columns = ["genotypes", "phenotypes"]
    optional_columns = ["stdeviations", "n_replicates"]
    for c in required_columns:
        try:
            df[c]
        except AttributeError:
            err = "input file ({}) must contain a column labeled '{}'".format(
                input_file_name, c)
            return AttributeError(err)

    # If wildtype is not given, use the first genotype in the input file.
    if not wildtype:
        wildtype = df.loc[0, 'genotypes']

    # Fill in missing columns for the GenotypePhenotypeMap
    for col in optional_columns:
        if col not in df.columns:
            df[col] = None

    gpm = GenotypePhenotypeMap.read_dataframe(df, wildtype)
    return gpm

예제 #2

파일 보기

    def fit_transform(self, X=None, y=None, **kwargs):
        self.fit(X=X, y=y, **kwargs)
        ypred = self.predict(X=X)

        # Transform map.
        gpm = GenotypePhenotypeMap.read_dataframe(
            dataframe=self.gpm.data[ypred==1],
            wildtype=self.gpm.wildtype,
            mutations=self.gpm.mutations
        )
        return gpm

예제 #3

파일 보기

    def fit_transform(self, X=None, y=None, **kwargs):
        self.fit(X=X, y=y, **kwargs)

        linear_phenotypes = self.transform(X=X, y=y)

        # Transform map.
        gpm = GenotypePhenotypeMap.read_dataframe(
            dataframe=self.gpm.data,
            wildtype=self.gpm.wildtype,
            mutations=self.gpm.mutations
        )

        gpm.data['phenotypes'] = linear_phenotypes
        return gpm

예제 #4

파일 보기

def split_gpm(gpm, idx=None, nobs=None, fraction=None):
    """Split GenotypePhenotypeMap into two sets, a training and a test set.

    Parameters
    ----------
    data : pandas.DataFrame
        full dataset to split.

    idx : list
        List of indices to include in training set

    nobs : int
        number of observations in training.

    fraction : float
        fraction in training set.

    Returns
    -------
    train_gpm : GenotypePhenotypeMap
        training set.

    test_gpm : GenotypePhenotypeMap
        test set.
    """
    train, test = split_data(gpm.data, idx=idx, nobs=nobs, fraction=fraction)

    train_gpm = GenotypePhenotypeMap.read_dataframe(train,
                                                    wildtype=gpm.wildtype,
                                                    mutations=gpm.mutations)

    test_gpm = GenotypePhenotypeMap.read_dataframe(test,
                                                   wildtype=gpm.wildtype,
                                                   mutations=gpm.mutations)

    return train_gpm, test_gpm