python的K-means代码

以下是Python实现K-means算法的基本代码： ```python import numpy as np import random # 计算两个向量之间的欧几里得距离 def euclidean_distance(x1, x2): return np.sqrt(np.sum((x1 - x2)**2)) class KMeans: def __init__(self, K=2, max_iters=100, plot_steps=False): self.K = K self.max_iters = max_iters self.plot_steps = plot_steps # 初始化聚类中心 self.centroids = [] # 每个数据点所属的簇 self.clusters = [[] for _ in range(self.K)] def predict(self, X): self.X = X self.n_samples, self.n_features = X.shape # 随机选择K个样本作为初始聚类中心 random_sample_idxs = random.sample(range(self.n_samples), self.K) self.centroids = [self.X[idx] for idx in random_sample_idxs] # 开始迭代 for _ in range(self.max_iters): self.clusters = self._create_clusters(self.centroids) if self.plot_steps: self._plot() old_centroids = self.centroids self.centroids = self._get_centroids(self.clusters) # 如果聚类中心不再变化，停止迭代 if self._is_converged(old_centroids, self.centroids): break if self.plot_steps: self._plot() # 返回每个数据点所属的簇 return self._get_cluster_labels(self.clusters) def _create_clusters(self, centroids): clusters = [[] for _ in range(self.K)] for idx, sample in enumerate(self.X): centroid_idx = self._closest_centroid(sample, centroids) clusters[centroid_idx].append(idx) return clusters def _closest_centroid(self, sample, centroids): distances = [euclidean_distance(sample, point) for point in centroids] closest_idx = np.argmin(distances) return closest_idx def _get_centroids(self, clusters): centroids = np.zeros((self.K, self.n_features)) for idx, cluster in enumerate(clusters): cluster_mean = np.mean(self.X[cluster], axis=0) centroids[idx] = cluster_mean return centroids def _is_converged(self, old_centroids, centroids): distances = [euclidean_distance(old_centroids[i], centroids[i]) for i in range(self.K)] return sum(distances) == 0 def _get_cluster_labels(self, clusters): labels = np.empty(self.n_samples) for cluster_idx, cluster in enumerate(clusters): for sample_idx in cluster: labels[sample_idx] = cluster_idx return labels def _plot(self): fig, ax = plt.subplots(figsize=(12, 8)) for i, index in enumerate(self.clusters): point = self.X[index].T ax.scatter(*point) for point in self.centroids: ax.scatter(*point, marker="x", color="black", linewidth=2) plt.show() ``` 使用示例： ```python from sklearn.datasets import make_blobs import matplotlib.pyplot as plt X, y = make_blobs(centers=3, n_samples=500, random_state=42) kmeans = KMeans(K=3, max_iters=150, plot_steps=True) y_pred = kmeans.predict(X) fig, ax = plt.subplots(figsize=(12, 8)) for i in range(len(X)): ax.scatter(X[i][0], X[i][1], color='C{}'.format(y_pred[i])) plt.show() ```

阅读全文

python的K-means代码

相关推荐

python基于K-means聚类算法的图像分割

Fuzzy-C-means Python代码

python K-Means 聚类算法例子

python k-means聚类代码

python实现k-means代码

python k-means

python k-means 聚类分析

PythonK-means聚类

pythonk-means影像变化检测代码

python k-means++算法代码

Python K-means聚类。的库

python sklearn k-means 代码示例

python的K-means算法代码

k-means python代码

python k-means图像变化检测

k-means代码详解

minst k-means python代码

python实点云分割k-means(sklearn)详解

K-means聚类分析与python实现

K-means聚类python代码

大家在看

读写通达信股票软件二进制dat文件

CMOS反相器的掩膜版图-集成电路版图设计

调制解调文档

Windows系统kb2577795-kb2553549 补丁

ISO/IEC 27005:2022 英文原版

最新推荐

python基于K-means聚类算法的图像分割

python实点云分割k-means(sklearn)详解

k-means 聚类算法与Python实现代码

Python用K-means聚类算法进行客户分群的实现

python中实现k-means聚类算法详解

虚拟串口软件：实现IP信号到虚拟串口的转换

【Python进阶篇】：掌握这些高级特性，让你的编程能力飞跃提升

后端调用ragflow api

IE6下实现PNG图片背景透明的技术解决方案

【欧姆龙触摸屏故障诊断全攻略】