Python get_alphas 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: family.utilities

메소드/함수: get_alphas

hotexamples.com에서의 예제들: 4

Python get_alphas - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 family.utilities.get_alphas에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: trio_model.py 프로젝트: maip/novo-muta

    def seq_err(self, member):
        """
        Calculate the probability of sequencing error. Assume each chromosome
        is equally-likely to be sequenced.

        The probability is drawn from a Dirichlet multinomial distribution:
        This is a point of divergence from the Cartwright et al. paper
        mentioned in the other functions.

        When the Dirichlet multinomial is called, the max element is stored in
        max_elems, so that the scaling of the probability matrix can be
        manipulated later.

        Args:
            member: Integer representing index of the read counts for a
                family member in the trio model.

        Returns:
            1 x 16 probability vector that needs to be multiplied by a
            transition matrix.
        """
        # TODO: add bias when alpha freq are added
        alpha_mat = ut.get_alphas(self.seq_err_rate) * self.dm_disp

        prob_mat = np.zeros((ut.GENOTYPE_COUNT))
        for i, alpha in enumerate(alpha_mat):
            log_proba = ut.dirichlet_multinomial(alpha, self.reads[member])
            prob_mat[i] = log_proba

        prob_mat_rescaled, max_elem = ut.normalspace(prob_mat)
        self.max_elems.append(max_elem)

        return prob_mat_rescaled

예제 #2

파일 보기

파일: trio_model.py 프로젝트: reedacartwright/novo-muta

    def seq_err(self, member):
        """
        Calculate the probability of sequencing error. Assume each chromosome
        is equally-likely to be sequenced.

        The probability is drawn from a Dirichlet multinomial distribution:
        This is a point of divergence from the Cartwright et al. paper
        mentioned in the other functions.

        When the Dirichlet multinomial is called, the max element is stored in
        max_elems, so that the scaling of the probability matrix can be
        manipulated later.

        Args:
            member: Integer representing index of the read counts for a
                family member in the trio model.

        Returns:
            1 x 16 probability vector that needs to be multiplied by a
            transition matrix.
        """
        # TODO: add bias when alpha freq are added
        alpha_mat = ut.get_alphas(self.seq_err_rate) * self.dm_disp

        prob_mat = np.zeros((ut.GENOTYPE_COUNT))
        for i, alpha in enumerate(alpha_mat):
            log_proba = ut.dirichlet_multinomial(alpha, self.reads[member])
            prob_mat[i] = log_proba

        prob_mat_rescaled, max_elem = ut.normalspace(prob_mat)
        self.max_elems.append(max_elem)

        return prob_mat_rescaled

예제 #3

파일 보기

파일: simulation_model.py 프로젝트: reedacartwright/novo-muta

    def dm_sample(self, soma_idx):
        """
        Use alpha frequencies based on the somatic genotype to select
        nucleotide frequencies and use these frequencies to draw sequencing
        reads at a specified coverage (Dirichlet multinomial).

        Args:
            soma_idx: Index of somatic genotype to get the appropriate alpha
            frequencies.

        Returns:
            Array containing read counts [#A, #C, #G, #T].
        """
        alpha_mat = (ut.get_alphas(self.trio_model.seq_err_rate) *
            self.trio_model.dm_disp)
        alpha = np.random.dirichlet(alpha_mat[soma_idx])
        return np.random.multinomial(self.cov, alpha)

예제 #4

파일 보기

    def dm_sample(self, soma_idx):
        """
        Use alpha frequencies based on the somatic genotype to select
        nucleotide frequencies and use these frequencies to draw sequencing
        reads at a specified coverage (Dirichlet multinomial).

        Args:
            soma_idx: Index of somatic genotype to get the appropriate alpha
            frequencies.

        Returns:
            Array containing read counts [#A, #C, #G, #T].
        """
        alpha_mat = (ut.get_alphas(self.trio_model.seq_err_rate) *
                     self.trio_model.dm_disp)
        alpha = np.random.dirichlet(alpha_mat[soma_idx])
        return np.random.multinomial(self.cov, alpha)