Python findGenes示例

编程语言: Python

命名空间/包名称: geneFinder

方法/功能: findGenes

hotexamples.com的示例: 4

Python findGenes - 已找到4个示例。这些是从开源项目中提取的最受好评的geneFinder.findGenes现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： interactionFinder.py 项目： bylin/text-mining

def parseLines(input, entrez, relex, authors):
	'''
	iterates through valid sentences, valid meaning the sentence is a good candidate for being parsed. Uses geneFinder() to find genes and positions of genes within the sentence, then extracts the metainfo we need using extractGenes(). At this step, filter out sentences without enough genes or relations (aka no possible interactions). Also filter out references using an authors database. It's important to process all Unicode characters because not all programs can handle them.
	'''
	for pmid, sentence in parseSentences(input):
		# logging.info("Parsing line: {}".format(sentence[:30] + ' ...... '))
		try:
			decodedSentence = unidecode(sentence.decode('utf-8'))
		except UnicodeDecodeError:
			logging.warning("Can't process Unicode for {}: {}".format(pmid, sentence))
			decodedSentence = sentence
		if isReference(sentence, authors):
			continue
		genesSupport, _ = geneFinder.findGenes(decodedSentence)
		geneIds, geneNames, rawNames = extractGenes(genesSupport, entrez, decodedSentence)
		relations = findRelations(sentence, relex)
		if len(geneNames) < 2 or len(relations) == 0:
			continue
		yield pmid, decodedSentence, geneIds, geneNames, rawNames, relations

示例#2

显示文件

def parseLines(input, entrez, relex, authors):
	'''
	iterates through valid sentences, valid meaning the sentence is a good candidate for being parsed. Uses geneFinder() to find genes and positions of genes within the sentence, then extracts the metainfo we need using extractGenes(). At this step, filter out sentences without enough genes or relations (aka no possible interactions). Also filter out references using an authors database. It's important to process all Unicode characters because not all programs can handle them.
	'''
	for pmid, sentence in parseSentences(input):
		# logging.info("Parsing line: {}".format(sentence[:30] + ' ...... '))
		try:
			decodedSentence = unidecode(sentence.decode('utf-8'))
		except UnicodeDecodeError:
			logging.warning("Can't process Unicode for {}: {}".format(pmid, sentence))
			decodedSentence = sentence
		if isReference(sentence, authors):
			continue
		genesSupport, _ = geneFinder.findGenes(decodedSentence)
		geneIds, geneNames, rawNames = extractGenes(genesSupport, entrez, decodedSentence)
		relations = findRelations(sentence, relex)
		if len(geneNames) < 2 or len(relations) == 0:
			continue
		yield pmid, decodedSentence, geneIds, geneNames, rawNames, relations

示例#3

显示文件

文件： varSearch.py 项目： Moxikai/pubMunch

def findGenes(pmid, text):
    """ return dict of entrezGene id -> mType -> (markerId, list of start, end)
    """
    genes, genePosSet = geneFinder.findGenes(text, pmid)
    return genes, genePosSet

示例#4

显示文件

def findGenes(pmid, text):
    """ return dict of entrezGene id -> mType -> (markerId, list of start, end)
    """
    genes, genePosSet = geneFinder.findGenes(text, pmid)
    return genes, genePosSet