Python webcontent_tokenizerの例

プログラミング言語: Python

名前空間/パッケージ名: utilities

メソッド/関数: webcontent_tokenizer

hotexamples.comのコード掲載数: 2

Python webcontent_tokenizer - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのutilities.webcontent_tokenizerの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: test.py プロジェクト: jawaharm/MachineLearningProject

def clean_And_Parse_tweets(tweets):
	# parsing and adding bag of words to the tweet_words for each tweet
	for tweet in tweets:
		tweet['tweet_words']=tweet_tokenizer(tweet['tweet_text'])
	# Get the content from the webpage
	for tweet in tweets:
		if tweet['tweet_urls']!="":
			webpage = Words_In_Webpage(tweet['tweet_urls'])
			tweet['tweet_webpage_words']= webcontent_tokenizer(webpage)
	return tweets

コード例 #2

ファイルを表示

ファイル: trend.py プロジェクト: jawaharm/MachineLearningProject

def clean_And_Parse_tweets(tweets):
	# parsing and adding bag of words to the tweet_words for each tweet
	for tweet in tweets:
		tweet['tweet_words']=tweet_tokenizer(tweet['tweet_text'])
	# Get the content from the webpage
	for tweet in tweets:
		tweet["word_vector"] = {}
		wordvecfinal = {}
		wordvec = {}
		for word in tweet['tweet_words']:
			wordvec.setdefault(word,0)
			wordvec[word]+=1
		if tweet['tweet_urls']!="":
			webpage = Words_In_Webpage(tweet['tweet_urls'])
			tweet['tweet_webpage_words']= webcontent_tokenizer(webpage)
			for word in tweet['tweet_webpage_words']:
				wordvec.setdefault(word,0)
				wordvec[word]+=1
			for word,wordcount in wordvec.items():
				#frac = float(wordcount)/len(wordvec)
				#if frac >0:
				wordvecfinal[word] = wordcount
		tweet["word_vector"] = wordvecfinal
	return tweets