Python census_ln 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: ethnicolr

메소드/함수: census_ln

hotexamples.com에서의 예제들: 5

Python census_ln - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 ethnicolr.census_ln에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: voters.py 프로젝트: BloodLustAlpaca/Code

def addEthnicityFields(df, namefield):
    # https://pypi.org/project/ethnicolr/#description
    import ethnicolr
    # use only the last word of the field for analysis
    df['ethname'] = df[namefield].transform(lambda t: t.split()[-1])
    # convert using library function
    df = ethnicolr.census_ln(df, 'ethname')
    # drop the temporary column
    df = df.drop(columns=['ethname'])

    newfields = [
        'pctwhite', 'pctblack', 'pctapi', 'pctaian', 'pct2prace', 'pcthispanic'
    ]
    for fieldname in newfields:
        df[fieldname] = pd.to_numeric(df[fieldname].astype(str),
                                      errors='coerce').astype(float)

    return df

예제 #2

파일 보기

파일: import_data.py 프로젝트: Sun-Kev/MACS30200proj

def run_census_last(subset_df, census_year):
    """
    This function takes a dataframe of teacher information and 
    runs the Census Ln Function. It provides the proportion of given
    last name that was registered as someone who was "white" during 
    the 2010 United States Census.

    Input:
    	- subset_df: a dataframe that is a subset of teacher information
    Output:
    	- df: a dataframe with proportion that the last name was "white"
    	during the 2010 Census
    """
    has_last_name_df = subset_df[subset_df.teacher_last.notnull()].copy()
    df = census_ln(has_last_name_df, 'teacher_last', census_year)

    # # keep the relevant columns
    # cols_to_keep = ['pctwhite']
    # df = df[cols_to_keep]

    # # fill NaNs w/ 50%
    # df.fillna(value=float(50), axis=1, inplace=True)

    return df

예제 #3

파일 보기

파일: ethnicolr-example.py 프로젝트: thezakpak/python-examples

#!/usr/bin/python
# -*- coding: utf-8 -*-

import pandas as pd

from ethnicolr import census_ln, pred_census_ln

names = [{'name': 'smith'}, {'name': 'zhang'}, {'name': 'jackson'}]

df = pd.DataFrame(names)

print(df)

print(census_ln(df, 'name'))

print(census_ln(df, 'name', 2010))

print(pred_census_ln(df, 'name'))

예제 #4

파일 보기

파일: ethnicity.py 프로젝트: maxcrous/Elections_Twitter_Project

native_american = 0
two_race = 0
df = []

if not os.path.exists('ethnicity.pkl'):

    with open('full.json', 'r') as tweets_file:

        for idx, line in enumerate(tweets_file):

            try:

                if idx % 10000 == 0 and idx != 0:
                    print(idx)
                    df = pd.DataFrame(df)
                    classed = census_ln(df, 'name')
                    classed = classed.dropna()
                    classed = classed.drop(['name'], axis=1)
                    classed = classed.replace('(S)', 0)
                    classed = classed.astype('float64')
                    classed = classed.divide(100)
                    white += float(classed['pctwhite'].sum())
                    black += float(classed['pctblack'].sum())
                    asian += float(classed['pctapi'].sum())
                    native_american += float(classed['pctaian'].sum())
                    two_race += float(classed['pct2prace'].sum())
                    hispanic += float(classed['pcthispanic'].sum())
                    df = []

                tweet = json.loads(line)
                name = tweet['user']['name']

예제 #5

파일 보기

파일: raw_impute_DIME_database.py 프로젝트: joshzyj/dime_race

def run_census_ln(subset_df, census_year):
    """Run the Census Ln Function."""
    has_last_name_df = subset_df[subset_df.contributor_lname.notnull()].copy()
    return census_ln(has_last_name_df, 'contributor_lname', census_year)