Python FormatEval.stripTags Examples

Programming Language: Python

Namespace/Package Name: formatEval

Class/Type: FormatEval

Method/Function: stripTags

Examples at hotexamples.com: 1

Python FormatEval.stripTags - 1 examples found. These are the top rated real world Python examples of formatEval.FormatEval.stripTags extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

getShuffledCorpus(2)

copy_files_for_eval(1)

getBiblFromDir(1)

getBiblList(1)

get_list_of_tag_from_dir(1)

stripTags(1)

strip_tags(1)

Example #1

Show file

File: partition.py Project: ansdma/bilbo

	def createEvaluationfiles(self, dirCorpus, testPercentage, numberOfPartition, allBibl):
		dirPartitions = self.getDirPartitionNames()
		for dirPartition in dirPartitions:
			(annotateDir, testDir, trainDir, modelDir, _) = self.getDirTestNames(dirPartition)
			testCorpus, trainCorpus = FormatEval.getShuffledCorpus(allBibl, testPercentage)
			
			trainFile = os.path.join(trainDir, 'train.xml')
			self.saveListToFile(trainCorpus, trainFile)
			
			cleanCorpus = FormatEval.stripTags(testCorpus)
			cleanFile = os.path.join(annotateDir, 'test_clean.xml')
			self.saveListToFile(cleanCorpus, cleanFile)

			# In test.xml we need to duplicate <bibl> inside <bibl>, in order to present the same data for evaluation
			# Bilbo does not format the "same" data equaly between train and annotation
			evalFile = os.path.join(testDir, 'test.xml')
			testCorpus = FormatEval.getBiblList("\n".join(testCorpus))
			self.saveListToFile(testCorpus, evalFile)