Python reads_in_group 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: idseq_dag.util.count

메소드/함수: reads_in_group

hotexamples.com에서의 예제들: 7

Python reads_in_group - 7개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 idseq_dag.util.count.reads_in_group에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

 def count_reads(self):
     self.should_count_reads = True
     self.counts_dict[self.name] = reads_in_group(
         file_group=self.output_files_local()[0:2],
         cluster_sizes=load_duplicate_cluster_sizes(
             self.input_cluster_sizes_path()),
         cluster_key=lambda x: x)

예제 #2

파일 보기

파일: pipeline_step.py 프로젝트: eamanu/idseq-dag

 def _count_reads_work(self, cluster_key, counter_name, fasta_files):
     # Count reads including duplicates (expanding cd-hit-dup clusters).
     self.should_count_reads = True
     self.counts_dict[counter_name] = count.reads_in_group(
         file_group=fasta_files,
         cluster_sizes=load_cdhit_cluster_sizes(self.input_cluster_sizes_path()),
         cluster_key=cluster_key)

예제 #3

파일 보기

 def count_reads(self):
     self.should_count_reads = True
     files_to_count = self.output_files_local()[0:2]
     read_count = count.reads_in_group(files_to_count)
     self.counts_dict[self.name] = read_count
     # If the read count is exactly equal to the maximum allowed number,
     # infer that subsampling occurred:
     max_read_count = len(
         files_to_count) * self.additional_attributes["max_fragments"]
     if read_count == max_read_count:
         self.counts_dict["subsampled"] = 1

예제 #4

파일 보기

    def count_input_reads(input_files, result_dir_local, result_dir_s3, target_name, max_fragments=None):
        local_input_files = [os.path.join(result_dir_local, f) for f in input_files[0:2]]
        count_file_basename = "%s.count" % target_name
        local_count_file = "%s/%s" % (result_dir_local, count_file_basename)
        s3_count_file = "%s/%s" % (result_dir_s3, count_file_basename)

        read_count = count.reads_in_group(local_input_files, max_fragments=max_fragments)
        counts_dict = {target_name: read_count}
        if read_count == len(local_input_files) * max_fragments:
            # If the number of reads is exactly equal to the maximum we specified,
            # it means that the input has been truncated.
            counts_dict["truncated"] = read_count

        with open(local_count_file, 'w') as count_file:
            json.dump(counts_dict, count_file)
        idseq_dag.util.s3.upload_with_retries(local_count_file, s3_count_file)

예제 #5

파일 보기

 def count_reads(self):
     self.should_count_reads = True
     self.counts_dict[self.name] = count.reads_in_group(self.output_files_local()[0:2])

예제 #6

파일 보기

파일: run_idseq_dedup.py 프로젝트: chanzuckerberg/idseq-workflows

 def count_reads(self):
     self.should_count_reads = True
     # Here we intentionally count unique reads.
     self.counts_dict[self.name] = count.reads_in_group(
         self.output_files_local()[:-2])  # last two outputs are not fastas

예제 #7

파일 보기

파일: generate_annotated_fasta.py 프로젝트: czbiohub/idseq-dag

 def count_reads(self):
     # count unidenfitied reads
     self.should_count_reads = True
     self.counts_dict["unidentified_fasta"] = count.reads_in_group([self.output_files_local()[1]])