Python NaiveBayes.MRNaiveBayesTrain 예제들

프로그래밍 언어: Python

클래스/타입: NaiveBayes

메소드/함수: MRNaiveBayesTrain

hotexamples.com에서의 예제들: 2

Python NaiveBayes.MRNaiveBayesTrain - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 NaiveBayes.MRNaiveBayesTrain 패키지로부터 MLfromscratch에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

NaiveBayes(30)

Classifier(13)

NaiveBayesModel(5)

Naive_Bayes(2)

predictClass(2)

MRNaiveBayesTrain(2)

Model(2)

sentimentClassify(1)

readFile(1)

VectorizedNB(1)

makeClassifier(1)

getSentimentClassifier(1)

generateMeasures(1)

test_gamma(1)

findMaxNumAttributes(1)

trainNB0(1)

Bayes(1)

NaiveBayseAlgorithm(1)

TrainTest(1)

NavieBayes(1)

NaiveBayesNltk(1)

NaiveBayesBernoulli(1)

NB_Train(1)

NB_Pred(1)

NBMain(1)

NBClassifier(1)

NAIVE_BAYES_MODEL(1)

GetProbabilityDistributionTable(1)

GetClassLabel(1)

train_gamma(1)

예제 #1

파일 보기

파일: NBPredictor.py 프로젝트: zsmjoe/school-projects

    def load_args(self, args):
        '''
        根据输入读取数据。
        get the input
        '''
        super(MRNaiveBayesTest, self).load_args(args)
        if self.options.continuous_features is not None:
            self.continuous = []
            temp = self.options.continuous_features.split(',')
            for num in temp:
                try:
                    num = int(num)
                except:
                    self.option_parser.error(
                        "The continuous features number you type in are not integer"
                    )
                self.continuous.append(num)

        # 读取model get the model
        if self.options.model is None:
            self.option_parser.error("please type the path to the model")
        else:
            self.model = {
            }  # 记录每个类别下所有特征取值的数量 count the number of features for each category
            self.total = {}  # 记录每个类别的总数 count the number of each distribution
            job = NaiveBayes.MRNaiveBayesTrain()
            with open(current + '/' + self.options.model,
                      encoding='utf-8') as src:
                for line in src:
                    try:
                        # 该行不是'all'行，读取该类别下该特征下该特征取值的数量,
                        # if the line is not all, take the number of the features for this category
                        (cat,
                         feature), (key,
                                    num) = job.parse_output_line(line.encode())
                    except:
                        # 该行是'all'行，读取该类别的总数量 if it is 'all', get the number of total features
                        (cat, _), num = job.parse_output_line(line.encode())
                        self.total[cat] = num
                        continue
                    if (cat not in self.model):
                        # 若该类别不在model中，建立该类别
                        #if this category not in the model, establish this category
                        self.model[cat] = {}
                    if (feature not in self.model[cat]):
                        # 若该特征不在model[cat]中，建立该特征
                        #if this feature not in model[cat], establish this feature
                        self.model[cat][feature] = {}
                    self.model[cat][feature][
                        key] = num  # 记录数量 count the number

예제 #2

파일 보기

파일: NBPredictor.py 프로젝트: hepengfei-ml/MapReduce-Machine-Learning

    def load_args(self,args):
        '''
        根据输入读取数据。
        '''
        super(MRNaiveBayesTest,self).load_args(args)
        if self.options.continuous_features is not None:
            self.continuous=[]
            temp = self.options.continuous_features.split(',')
            for num in temp:
                try:
                    num = int(num)
                except:
                    self.option_parser.error("The continuous features number you type in are not integer")
                self.continuous.append(num)

        #读取model
        if self.options.model is None:
            self.option_parser.error("please type the path to the model")
        else:
            self.model = {} #记录每个类别下所有特征取值的数量
            self.total = {} #记录每个类别的总数
            job = NaiveBayes.MRNaiveBayesTrain()
            with open(current+'/'+self.options.model,encoding='utf-8') as src:
                for line in src:
                    try:
                        #该行不是'all'行，读取该类别下该特征下该特征取值的数量
                        (cat, feature), (key, num) = job.parse_output_line(line.encode())
                    except:
                        #该行是'all'行，读取该类别的总数量
                        (cat, _), num = job.parse_output_line(line.encode())
                        self.total[cat] = num
                        continue
                    if(cat not in self.model):
                        #若该类别不在model中，建立该类别
                        self.model[cat] = {}
                    if(feature not in self.model[cat]):
                        #若该特征不在model[cat]中，建立该特征
                        self.model[cat][feature] = {}
                    self.model[cat][feature][key] = num #记录数量