Python is_categorical 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utility

메소드/함수: is_categorical

hotexamples.com에서의 예제들: 3

Python is_categorical - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utility.is_categorical에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: feature.py 프로젝트: dolaameng/practical_munging_tools

	def fit(self, df, fnames, by):
		self.cprobs, self.fnames = [], []
		for f in fnames:
			if not utility.is_categorical(df, f):
				raise ValueError(f+" must be categorical, use encoding or discretizing")
			cprob = pd.crosstab(df[by], df[f])
			cprob = cprob / cprob.sum(axis = 0)
			cprob = cprob.iloc[1, :]
			cprob.name = "%sIs%s_on_%s" % (by, cprob.name, f)
			self.cprobs.append(cprob)
			self.fnames.append(f)
		return self

예제 #2

파일 보기

파일: feature.py 프로젝트: dolaameng/practical_munging_tools

def _extract_cprobs_by_biclass(df, fnames, by, copy = True):
	"""
	Depreciated - use BiClassProbabilityFeatureExtractor to better handle train and validate data 
	Extract conditional probability features based on binary class labels, 
	see Ref 1 chapter 6 for details. 
	df: DataFrame
	fnames: features to be extracted from - the features must be categorical or discretized numerical
	(call transformation.discretize_numerical for that)
	by: binary class labels (for multiple labels, use one-hot-encoding to get the cprobs-features separately)
	copy: whether copy dataframe or modify in place 
	"""
	result = df.copy() if copy else df 
	for f in fnames:
		if not utility.is_categorical(df, f):
			raise ValueError(f+" must be categorical, use encoding or discretizing")
		cprobs = pd.crosstab(df[by], df[f])
		cprobs = cprobs / cprobs.sum(axis = 0)
		cprobs = cprobs.iloc[1, :]
		cprobs.name = "%sIs%s_on_%s" % (by, cprobs.name, f)
		result = result.join(cprobs, on = f)
	return result

예제 #3

파일 보기

파일: inspection.py 프로젝트: dolaameng/practical_munging_tools

def find_categorical_features(df):
	return np.asarray([f for f in df.columns if is_categorical(df, f)])