Python AgglomerativeClustering.labels_示例

编程语言: Python

命名空间/包名称: sklearn.cluster

方法/功能: labels_

hotexamples.com的示例: 2

Python AgglomerativeClustering.labels_ - 已找到2个示例。这些是从开源项目中提取的最受好评的sklearn.cluster.AgglomerativeClustering.labels_现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

AgglomerativeClustering(30)

fit(30)

predict(27)

fit_predict(26)

get_params(7)

set_params(5)

n_clusters(5)

tolist(2)

reshape(2)

__init__(2)

labels_(2)

connectivity(2)

compute_full_tree(2)

eval_model(1)

train_model(1)

train(1)

affinity(1)

to_pickle(1)

to_csv(1)

sort_values(1)

apply(1)

score(1)

resize(1)

n_clusters_(1)

drop(1)

copy(1)

model_name(1)

merge(1)

linkage(1)

distance_threshold(1)

labels(1)

isnull(1)

index(1)

groupby(1)

distances_(1)

value_counts(1)

示例#1

显示文件

def agglomerate(corpus, threshold=1.4, ignoreOutliers=True):
    """
    Cluster a set of questions using the hierarchical (bottom-up) agglomerative clustering method.

    Parameters:
        corpus (list): Tagged Questions Corpus collection of questions and their ids
        threshold (int): Interger value to determine the distance threshold for the cut-off point as we build the dendrogram
        removeOutliers (bool): A flag to determine whether to remove outliers or not
            (default is True)

    Returns:
        corpus: Corpus that has clusters list attached to it
    """

    repMatrix = makeRepresentationMatrix(corpus)

    outliers = []
    if ignoreOutliers:
        outliers, repMatrix = getOutliers(repMatrix, corpus=corpus)

    clustering = AgglomerativeClustering(linkage="ward",
                                         distance_threshold=threshold,
                                         n_clusters=None)

    clustering.fit(repMatrix)
    mapping = [i[1] - i[0] for i in enumerate(outliers)]
    clustering.labels_ = np.insert(clustering.labels_, mapping, -1)
    clusterMap = createClusterMap(corpus, clustering)
    print(clusterMap)  #JEFFLAG
    corpus = nameClusters(clusterMap)
    print("The corpus's clusters after naming")
    print(corpus.clusters)  #JEFFLAG
    return corpus

示例#2

显示文件

文件： clustering.py 项目： usc-psychsim/atomic

def update_clusters(clustering: AgglomerativeClustering,
                    new_distance_threshold: float):
    """
    Updates the cluster labels for each datapoint to be consistent with the algorithm's hierarchy and given distance
    threshold. Useful when we already ran the HAC algorithm to determine the points' hierarchy but want to change the
    threshold at which the number of clusters is found.
    :param AgglomerativeClustering clustering: the clustering algorithm with the distances
    :param float new_distance_threshold: the new distance threshold at which the number of clusters is to be determined.
    :return:
    """
    clustering.distance_threshold = new_distance_threshold
    clustering.labels_ = np.full_like(clustering.labels_, -1, dtype=int)
    _update_clusters(clustering)
    clustering.labels_ = np.max(
        clustering.labels_
    ) - clustering.labels_  # invert to follow natural order
    clustering.n_clusters_ = int(np.max(clustering.labels_) + 1)