X = np.array(df[df.columns[:11]]) X

这是一个将 Pandas DataFrame 对象转换为 NumPy 数组的操作。假设 df 是一个包含葡萄酒数据的 DataFrame 对象，该语句是将 df 中前11列的数据提取出来，并转换为一个 NumPy 的数组 X。该操作可以用于将 Pandas DataFrame 对象转换为适合进行科学计算和机器学习的数据结构。

import numpy as np import pandas as pd def localmin(points, pixel_size): x_min = np.min(points[:, 0]) y_min = np.min(points[:, 1]) x_max = np.max(points[:, 0]) y_max = np.max(points[:, 1]) w = x_max - x_min h = y_max - y_min wn = w // pixel_size + 1 hn = h // pixel_size + 1 x_bins = np.array([i * pixel_size for i in range(int(wn + 1))]) y_bins = np.array([i * pixel_size for i in range(int(hn + 1))]) df = pd.DataFrame(points, columns=['x', 'y', 'z', 'classification', 'indices']) df['x_bin'] = pd.cut(df['x'], bins=x_bins) df['y_bin'] = pd.cut(df['y'], bins=y_bins) result = df.groupby(['x_bin', 'y_bin']).apply(lambda x: x.loc[x['z'].idxmin()])[['x', 'y', 'z', 'classification', 'indices']] return result a = np.random.random([100,3]) b = np.random.random([100, 1])//0.5 c = np.arange(1000).reshape([100,1]) a = np.concatenate([a,b,c],axis=-1) d = localmin(a,0.2) 如何从a中删除d中的点

可以使用pandas的isin()方法来判断a中的点是否出现在d中，然后使用~运算符将不包含在d中的点筛选出来。具体实现代码如下： ```python import numpy as np import pandas as pd def localmin(points, pixel_size): x_min = np.min(points[:, 0]) y_min = np.min(points[:, 1]) x_max = np.max(points[:, 0]) y_max = np.max(points[:, 1]) w = x_max - x_min h = y_max - y_min wn = w // pixel_size + 1 hn = h // pixel_size + 1 x_bins = np.array([i * pixel_size for i in range(int(wn + 1))]) y_bins = np.array([i * pixel_size for i in range(int(hn + 1))]) df = pd.DataFrame(points, columns=['x', 'y', 'z', 'classification', 'indices']) df['x_bin'] = pd.cut(df['x'], bins=x_bins) df['y_bin'] = pd.cut(df['y'], bins=y_bins) result = df.groupby(['x_bin', 'y_bin']).apply(lambda x: x.loc[x['z'].idxmin()])[['x', 'y', 'z', 'classification', 'indices']] return result a = np.random.random([100,3]) b = np.random.random([100, 1])//0.5 c = np.arange(1000).reshape([100,1]) a = np.concatenate([a,b,c],axis=-1) d = localmin(a,0.2) # 筛选出不包含在d中的点并重新赋值给a a = a[~a[:, :2].isin(d.iloc[:, :2].values).any(axis=1)] ``` 其中，isin()方法用于判断一个DataFrame或Series对象中的值是否在另一个DataFrame或Series对象中出现过。在这里我们用isin()方法来判断a中的前两列（即x和y坐标）是否出现在d中，然后使用any()方法将结果沿着列方向合并，得到一个长度为100的布尔型数组，表示a中每个点是否在d中出现过。最后使用~运算符对数组进行取反，得到不包含在d中的点的布尔型数组，然后使用该数组对a进行筛选即可。

df = pd.read_csv('./part-00000-66a9d65e-cad2-4f62-af22-e9acbec50dbc.c000.csv', low_memory=False) sig_cell_volt = np.array(df.iloc[:1000, 10]) sum_volt = list(np.array(df.iloc[:1000, 6])) # print(sig_cell_volt) all_cell_vot = [] for i in sig_cell_volt: i = i[2:] cell_str = i.split('_') cell_list = [] for t in cell_str: t = float(t)/1000 cell_list.append(t) # print(len(cell_list)) all_cell_vot.append(cell_list) all_cell_vot = np.array(all_cell_vot) # x_data = torch.from_numpy(all_cell_vot) print(all_cell_vot) # sing_vol_df = pd.DataFrame(all_cell_vot) # writer = pd.ExcelWriter('vol.xlsx') # sing_vol_df.to_excel(writer,'sheet1',float_format='%.5f') df1 = pd.read_excel('vol.xlsx') col_name=df1.columns.tolist() col_name.insert(95, '总电压') df1['总电压'] = sum_volt df1.to_excel('piggy22.xlsx')

这段代码首先是在上一个代码段的基础上进行的。在处理完数据后，代码将 all_cell_vot 数组中的数据存储到了名为 sing_vol_df 的 DataFrame 中，并将这个 DataFrame 存储到了一个名为 vol.xlsx 的 Excel 文件中。接下来，代码又读取了 vol.xlsx 文件，并将 DataFrame 中的列名存储到了名为 col_name 的列表中。然后，通过 insert 方法在 col_name 列表的索引 95 处插入了一个名为“总电压”的列名。接着，代码将 sum_volt 列表中的数据存储到了新添加的“总电压”列中。最后，通过 to_excel 方法将更新后的 DataFrame 存储到了一个名为 piggy22.xlsx 的 Excel 文件中。

阅读全文

X = np.array(df[df.columns[:11]]) X

相关推荐

df.rank与pivot_table详解：数据排序与透视表操作

Vue.js实战：打造可排序表格组件

BootstrapTable行内编辑实践：x-editable详解与示例

import numpy as np X = np.array([[1, 1], [2, 2], [3, 3]]) #1.转换成DataFrame格式 #2.利用PCA降维 #3.输出降维后的数据 #4.输出载荷 #5.输出方程 F1=0.231*X1+0.597*X2

df = pd.DataFrame(np.array(reviews), columns=['review'])

python ls=[] df6=pd.read_excel("math6.xlsx") df6['t(h)']=0 df6.columns=['0','1','2','3','4','5','6','7'] df61=np.array(df6) #数据类型转换 for i in range(1,14): for j in range(1,7): 试着优化这段代码

大家在看

中子针孔成像点扩展函数模拟研究

华为组播PIM-SM过程总结

HCNP-WLAN-CEWA(H12-321)题库.pdf

汽车电子通信协议SAE J2284

异常处理-mipsCPU简介

最新推荐

掌握HTML/CSS/JS和Node.js的Web应用开发实践

管理建模和仿真的文件

计算机体系结构概述：基础概念与发展趋势

int a[][3]={{1,2},{4}}输出这个数组

勒玛算法研讨会项目：在线商店模拟与Qt界面实现

"互动学习：行动中的多样性与论文攻读经历"

【计算机组成原理精讲】：从零开始深入理解计算机硬件

vue2加载高德地图

Edge语法革新：打造WPF界面新体验

关系数据表示学习

import numpy as np X = np.array([[1, 1], [2, 2], [3, 3]]) #1.转换成DataFrame格式 #2.利用PCA降维 #3.输出降维后的数据 #4.输出载荷 #5.输出方程 F1=0.231X1+0.597X2