Python LogOddsRatioUninformativeDirichletPrior 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: scattertext.termsignificance

클래스/타입: LogOddsRatioUninformativeDirichletPrior

hotexamples.com에서의 예제들: 7

Python LogOddsRatioUninformativeDirichletPrior - 7개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 scattertext.termsignificance.LogOddsRatioUninformativeDirichletPrior에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

LogOddsRatioUninformativeDirichletPrior(4)

__init__(2)

예제 #1

파일 보기

    def __init__(self, priors, sigma=10, scale_type='none', prior_power=1):
        '''
		Parameters
		----------
		priors : pd.Series
			term -> prior count

		sigma : np.float
			prior scale

		scale_type : str
			'none': Don't scale prior. Jurafsky approach.
			'class-size': Scale prior st the sum of the priors is the same as the word count
			  in the document-class being scaled
			'corpus-size': Scale prior to the size of the corpus
			'word': Original formulation from MCQ. Sum of priors will be sigma.
			'background-corpus-size': Scale corpus size to multiple of background-corpus.

		prior_power : numeric
			Exponent to apply to prior
			> 1 will shrink frequent words

		'''
        assert scale_type in [
            'none', 'class-size', 'corpus-size', 'background-corpus-size',
            'word'
        ]
        self._priors = priors
        self._scale_type = scale_type
        self._prior_power = prior_power
        self._scale = sigma
        LogOddsRatioUninformativeDirichletPrior.__init__(self, sigma)

예제 #2

파일 보기

파일: LogOddsRatioInformativeDirichletPiror.py 프로젝트: JasonKessler/scattertext

	def __init__(self,
	             priors,
	             sigma=10,
	             scale_type='none',
	             prior_power=1):
		'''
		Parameters
		----------
		priors : pd.Series
			term -> prior count

		sigma : np.float
			prior scale

		scale_type : str
			'none': Don't scale prior. Jurafsky approach.
			'class-size': Scale prior st the sum of the priors is the same as the word count
			  in the document-class being scaled
			'corpus-size': Scale prior to the size of the corpus
			'word': Original formulation from MCQ. Sum of priors will be sigma.
			'background-corpus-size': Scale corpus size to multiple of background-corpus.

		prior_power : numeric
			Exponent to apply to prior
			> 1 will shrink frequent words

		'''
		assert scale_type in ['none', 'class-size', 'corpus-size',
		                      'background-corpus-size', 'word']
		self._priors = priors
		self._scale_type = scale_type
		self._prior_power = prior_power
		self._scale = sigma
		LogOddsRatioUninformativeDirichletPrior.__init__(self, sigma)

예제 #3

파일 보기

파일: LogOddsRatioInformativeDirichletPiror.py 프로젝트: rjhere/scattertext

    def __init__(self, priors, alpha_w=10):
        '''
		Parameters
		----------
		alpha_w : np.float
			The constant prior.
		'''
        self._priors = priors
        LogOddsRatioUninformativeDirichletPrior.__init__(self, alpha_w)

예제 #4

파일 보기

	def test_get_p_vals(self):
		tdm = build_hamlet_jz_term_doc_mat()
		df = tdm.get_term_freq_df()
		X = df[['hamlet freq', 'jay-z/r. kelly freq']].values
		pvals = LogOddsRatioUninformativeDirichletPrior().get_p_vals(X)
		self.assertGreaterEqual(min(pvals), 0)
		self.assertLessEqual(min(pvals), 1)

예제 #5

파일 보기

파일: LogOddsUniformativePriorScore.py 프로젝트: tanvijain13/CS5590-490-0001-Python-and-Deep-Learning-Programming-

 def get_thresholded_score(cat_word_counts,
                           not_cat_word_counts,
                           alpha_w=0.01,
                           threshold=0.1):
     scores = (LogOddsRatioUninformativeDirichletPrior(
         alpha_w).get_p_values_from_counts(cat_word_counts,
                                           not_cat_word_counts)) * 2 - 1
     # scores = (np.min(np.array([1 - scores, scores]), axis=0) <= threshold) * scores
     return scores * ((scores < -(1. - (threshold * 2)))
                      | (scores > (1. - (threshold * 2))))

예제 #6

파일 보기

파일: LogOddsUniformativePriorScore.py 프로젝트: zluckyhou/scattertext

	def get_score(cat_word_counts, not_cat_word_counts, alpha_w=0.01):
		X = LogOddsUninformativePriorScore. \
			_turn_counts_into_matrix(cat_word_counts, not_cat_word_counts)
		p_vals = LogOddsRatioUninformativeDirichletPrior(alpha_w).get_p_vals(X)
		scores = LogOddsUninformativePriorScore._turn_pvals_into_scores(p_vals)
		return scores

예제 #7

파일 보기

파일: LogOddsUniformativePriorScore.py 프로젝트: zluckyhou/scattertext

	def get_delta_hats(cat_word_counts, not_cat_word_counts, alpha_w=0.01):
		return (LogOddsRatioUninformativeDirichletPrior(alpha_w)
		        .get_log_odds_with_prior(LogOddsUninformativePriorScore
		                                 ._turn_counts_into_matrix(cat_word_counts,
		                                                           not_cat_word_counts)))