Python OrthoMCLCluster 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: orthomcl

클래스/타입: OrthoMCLCluster

hotexamples.com에서의 예제들: 6

Python OrthoMCLCluster - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 orthomcl.OrthoMCLCluster에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

OrthoMCLCluster(3)

get_name(3)

get_gene_hash(2)

add_gene(1)

get_count(1)

get_species_hash(1)

to_s(1)

예제 #1

파일 보기

파일: geneid2cluster.py 프로젝트: lierhan/bioinformatics

def main():
  inFile = plausi()
  fo = open(inFile)
  for line in fo:
    o = OrthoMCLCluster(line.rstrip())
    name = o.get_name()
    geneHash = o.get_gene_hash()
    for geneid, species in geneHash.iteritems(): print geneid + "\t" + name

예제 #2

파일 보기

파일: geneid2cluster.py 프로젝트: sjjose2/bioinformatics

def main():
    inFile = plausi()
    fo = open(inFile)
    for line in fo:
        o = OrthoMCLCluster(line.rstrip())
        name = o.get_name()
        geneHash = o.get_gene_hash()
        for geneid, species in geneHash.iteritems():
            print geneid + "\t" + name

예제 #3

파일 보기

def main():
    args = plausi()
    in_orthomcl = args[0]
    EVALUE = float('1e-20')
    IDENTITY = 30.0
    if len(args) == 4:
        in_fasta, in_gg, in_blast = args[1:4]
        gene2species, speciesArray = read_gg(in_gg)
        gene2length = get_seq_lengths(in_fasta)
        dbmfile = in_blast + ".add.dbm"
        dbm = anydbm.open(dbmfile, "c")
        fo = open(in_blast)
        for line in fo:
            line = line.rstrip()
            cols = line.split("\t")
            qid, hid, evalue, identity = cols[0], cols[1], float(
                cols[10]), float(cols[2])
            # ignore self-hits and between-species hits, check e-value threshold
            if qid == hid: continue
            if gene2species[qid] != gene2species[hid]: continue
            if evalue > EVALUE: continue
            if identity < IDENTITY: continue
            # check that blast alignment spans at least 75% of the longer sequence
            alnlength, qlength, hlength = int(
                cols[3]), gene2length[qid], gene2length[hid]
            lengthcutoff = 0.80 * max([qlength, hlength])
            if alnlength < lengthcutoff: continue
            if not dbm.has_key(qid): dbm[qid] = ""
            else: dbm[qid] += " "
            dbm[qid] += hid
        fo.close()
        dbm.close()
    else:
        dbmfile = args[1]
    dbm = anydbm.open(dbmfile)

    fo = open(in_orthomcl)
    for line in fo:
        o = OrthoMCLCluster(line.rstrip())
        oldsize = o.get_count()
        additions = []
        for geneid, species in o.get_gene_hash().iteritems():
            if not dbm.has_key(geneid): continue
            [additions.append([x, species]) for x in dbm[geneid].split()]

        for x, species in additions:
            o.add_gene(x, species)
        o.to_s()
        newsize = o.get_count()
        print >> sys.stderr, "%s\t%s\t%s" % (o.get_name(), oldsize, newsize)

예제 #4

파일 보기

파일: add-blasthits-to-cluster.py 프로젝트: lierhan/bioinformatics

def main():
  args = plausi()
  in_orthomcl = args[0]
  EVALUE = float('1e-20')
  IDENTITY = 30.0
  if len(args) == 4:
    in_fasta, in_gg, in_blast = args[1:4]
    gene2species, speciesArray = read_gg(in_gg)
    gene2length = get_seq_lengths(in_fasta)
    dbmfile = in_blast + ".add.dbm"
    dbm = anydbm.open(dbmfile, "c")
    fo = open(in_blast)
    for line in fo: 
      line = line.rstrip()
      cols = line.split("\t")
      qid, hid, evalue, identity = cols[0], cols[1], float(cols[10]), float(cols[2])
      # ignore self-hits and between-species hits, check e-value threshold
      if qid == hid: continue
      if gene2species[qid] != gene2species[hid]: continue
      if evalue > EVALUE: continue
      if identity < IDENTITY: continue
      # check that blast alignment spans at least 75% of the longer sequence
      alnlength, qlength, hlength = int(cols[3]), gene2length[qid], gene2length[hid]
      lengthcutoff = 0.80 * max([qlength, hlength])
      if alnlength < lengthcutoff: continue
      if not dbm.has_key(qid): dbm[qid] = ""
      else: dbm[qid] += " "
      dbm[qid] += hid
    fo.close()
    dbm.close()
  else: dbmfile = args[1]
  dbm = anydbm.open(dbmfile)

  fo = open(in_orthomcl)
  for line in fo:
    o = OrthoMCLCluster(line.rstrip())
    oldsize = o.get_count()
    additions = []
    for geneid, species in o.get_gene_hash().iteritems():
      if not dbm.has_key(geneid): continue
      [additions.append([x, species]) for x in dbm[geneid].split()]

    for x, species in additions: o.add_gene(x,species)
    o.to_s()
    newsize = o.get_count()
    print >> sys.stderr, "%s\t%s\t%s" %(o.get_name(), oldsize, newsize)

예제 #5

파일 보기

파일: cluster2arath.py 프로젝트: lierhan/bioinformatics

def main():
  inFile = plausi()
  fo = open(inFile)
  for line in fo:
    o = OrthoMCLCluster(line.rstrip())
    print o.get_name() + "\t" + o.get_species_hash()['Arath'][0]

예제 #6

파일 보기

파일: cluster2arath.py 프로젝트: sjjose2/bioinformatics

def main():
    inFile = plausi()
    fo = open(inFile)
    for line in fo:
        o = OrthoMCLCluster(line.rstrip())
        print o.get_name() + "\t" + o.get_species_hash()['Arath'][0]