使用python实现可以返回SSE的wkmeans

好的，以下是一个基于Python实现的可以返回SSE的wkmeans代码： ```python import numpy as np def euclidean_distance(x1, x2): return np.sqrt(np.sum((x1 - x2) ** 2)) class WKMeans: def __init__(self, k=2, max_iters=100, plot_steps=False): self.k = k self.max_iters = max_iters self.plot_steps = plot_steps # list of sample indices for each cluster self.clusters = [[] for _ in range(self.k)] # mean feature vector for each cluster self.centroids = [] # SSE (sum of squared errors) for each iteration self.sse = [] def predict(self, X): self.X = X self.n_samples, self.n_features = X.shape # initialize centroids random_sample_idxs = np.random.choice(self.n_samples, self.k, replace=False) self.centroids = [self.X[idx] for idx in random_sample_idxs] # optimization loop for i in range(self.max_iters): # update clusters self.clusters = self._create_clusters(self.centroids) if self.plot_steps: self.plot() # update centroids centroids_old = self.centroids self.centroids = self._get_centroids(self.clusters) # check if converged if self._is_converged(centroids_old, self.centroids): break # calculate SSE self.sse.append(self._get_sse()) return self._get_cluster_labels(self.clusters) def _get_cluster_labels(self, clusters): labels = np.empty(self.n_samples) for cluster_idx, cluster in enumerate(clusters): for sample_idx in cluster: labels[sample_idx] = cluster_idx return labels def _create_clusters(self, centroids): clusters = [[] for _ in range(self.k)] for idx, sample in enumerate(self.X): centroid_idx = self._closest_centroid(sample, centroids) clusters[centroid_idx].append(idx) return clusters def _closest_centroid(self, sample, centroids): distances = [euclidean_distance(sample, point) for point in centroids] closest_idx = np.argmin(distances) return closest_idx def _get_centroids(self, clusters): centroids = np.zeros((self.k, self.n_features)) for cluster_idx, cluster in enumerate(clusters): cluster_mean = np.mean(self.X[cluster], axis=0) centroids[cluster_idx] = cluster_mean return centroids def _is_converged(self, centroids_old, centroids): distances = [euclidean_distance(centroids_old[i], centroids[i]) for i in range(self.k)] return sum(distances) == 0 def _get_sse(self): sse = 0 for cluster_idx, cluster in enumerate(self.clusters): for sample_idx in cluster: sse += euclidean_distance(self.X[sample_idx], self.centroids[cluster_idx]) return sse ``` 该代码使用numpy进行向量运算和计算欧几里得距离。初始化KMeans对象时可以指定聚类数目k、最大迭代次数max_iters和是否绘制迭代过程中每个簇的样本点分布（plot_steps），在调用predict方法时将数据集X传入进行聚类并返回每个样本点所属簇的标签。最后可以通过KMeans对象的sse属性获取每次迭代后的SSE值。

阅读全文

使用python实现可以返回SSE的wkmeans

相关推荐

基于python实现KMeans算法

python实现KMeans算法

大数据的作业 基于python实现kmeans 是对球队强弱进行聚类+源代码+文档说明+数据

python 实现return返回多个值

使用python实现插值

Python实现的差分进化算法源代码 使用Python实现的差分进化算法

使用python实现Cat命令

使用Python实现决策树

使用Python实现一个闹钟

使用python实现爬虫获取众多大学排名

使用Python实现微信提醒备忘录功能

选择排序.py 使用python实现的代码

使用python实现微信小程序自动签到功能

图像处理大作业 使用python实现简易PS

源码：使用python实现定时任务的几种方式； python多线程

基于Fins协议使用python实现PC与PLC的通讯

使用Python实现文字转语音并生成wav文件的例子

使用Python实现person皮尔逊相关性分析

使用Python实现并分析银行业务模拟系统

使用Python实现t-SNE降维可视化

大家在看

电路ESD防护原理与设计实例.pdf

微机原理与嵌入式实验讲义1

OFDM接收机的设计——ADC样值同步-OFDM通信系统基带设计细化方案

USB_HUB硬件电路引脚原理解析.docx

一种应用于AMOLED的阵列扫描控制电路 (2011年)

最新推荐

使用Python实现正态分布、正态分布采样

使用python实现离散时间傅里叶变换的方法

python实现录屏功能(亲测好用)

使用Python实现 学生学籍管理系统

python使用paramiko实现ssh的功能详解

S7-PDIAG工具使用教程及技术资料下载指南

管理建模和仿真的文件

CC-LINK远程IO模块AJ65SBTB1现场应用指南：常见问题快速解决

python 画一个进度条

Nginx 1.19.0版本Windows服务器部署指南

大数据的作业基于python实现kmeans 是对球队强弱进行聚类+源代码+文档说明+数据

Python实现的差分进化算法源代码使用Python实现的差分进化算法

图像处理大作业使用python实现简易PS

使用Python实现学生学籍管理系统