Ejemplos de AgglomerativeClustering.labels_ en Python

Lenguaje de programación: Python

Namespace/Package Name: sklearn.cluster

Método / Función: labels_

Ejemplos en hotexamples.com: 2

Python AgglomerativeClustering.labels_ - 2 ejemplos encontrados. Estos son los ejemplos en Python del mundo real mejor valorados de sklearn.cluster.AgglomerativeClustering.labels_ extraídos de proyectos de código abierto. Puedes valorar ejemplos para ayudarnos a mejorar la calidad de los ejemplos.

Métodos usados con frecuencia

Mostrar Ocultar

AgglomerativeClustering(30)

fit(30)

predict(27)

fit_predict(26)

get_params(7)

set_params(5)

n_clusters(5)

tolist(2)

reshape(2)

__init__(2)

labels_(2)

connectivity(2)

compute_full_tree(2)

eval_model(1)

train_model(1)

train(1)

affinity(1)

to_pickle(1)

to_csv(1)

sort_values(1)

apply(1)

score(1)

resize(1)

n_clusters_(1)

drop(1)

copy(1)

model_name(1)

merge(1)

linkage(1)

distance_threshold(1)

labels(1)

isnull(1)

index(1)

groupby(1)

distances_(1)

value_counts(1)

Ejemplo n.º 1

Mostrar archivo

def agglomerate(corpus, threshold=1.4, ignoreOutliers=True):
    """
    Cluster a set of questions using the hierarchical (bottom-up) agglomerative clustering method.

    Parameters:
        corpus (list): Tagged Questions Corpus collection of questions and their ids
        threshold (int): Interger value to determine the distance threshold for the cut-off point as we build the dendrogram
        removeOutliers (bool): A flag to determine whether to remove outliers or not
            (default is True)

    Returns:
        corpus: Corpus that has clusters list attached to it
    """

    repMatrix = makeRepresentationMatrix(corpus)

    outliers = []
    if ignoreOutliers:
        outliers, repMatrix = getOutliers(repMatrix, corpus=corpus)

    clustering = AgglomerativeClustering(linkage="ward",
                                         distance_threshold=threshold,
                                         n_clusters=None)

    clustering.fit(repMatrix)
    mapping = [i[1] - i[0] for i in enumerate(outliers)]
    clustering.labels_ = np.insert(clustering.labels_, mapping, -1)
    clusterMap = createClusterMap(corpus, clustering)
    print(clusterMap)  #JEFFLAG
    corpus = nameClusters(clusterMap)
    print("The corpus's clusters after naming")
    print(corpus.clusters)  #JEFFLAG
    return corpus

Ejemplo n.º 2

Mostrar archivo

Archivo: clustering.py Proyecto: usc-psychsim/atomic

def update_clusters(clustering: AgglomerativeClustering,
                    new_distance_threshold: float):
    """
    Updates the cluster labels for each datapoint to be consistent with the algorithm's hierarchy and given distance
    threshold. Useful when we already ran the HAC algorithm to determine the points' hierarchy but want to change the
    threshold at which the number of clusters is found.
    :param AgglomerativeClustering clustering: the clustering algorithm with the distances
    :param float new_distance_threshold: the new distance threshold at which the number of clusters is to be determined.
    :return:
    """
    clustering.distance_threshold = new_distance_threshold
    clustering.labels_ = np.full_like(clustering.labels_, -1, dtype=int)
    _update_clusters(clustering)
    clustering.labels_ = np.max(
        clustering.labels_
    ) - clustering.labels_  # invert to follow natural order
    clustering.n_clusters_ = int(np.max(clustering.labels_) + 1)