df = pd.read_csv('stock_comments_analyzed.csv', parse_dates=['created_time']) grouped = df['polarity'].groupby(df.created_time.dt.date) def BI_Simple_func(row): pos = row[row == 1].count() neg = row[row == 0].count() return (pos-neg)/(pos+neg) BI_Simple_index = grouped.apply(BI_Simple_func) def BI_func(row): pos = row[row == 1].count() neg = row[row == 0].count() bi = np.log(1.0 * (1+pos) / (1+neg)) return bi BI_index = grouped.apply(BI_func) sentiment_idx = pd.concat([BI_index.rename('BI'), BI_Simple_index.rename('BI_Simple')], axis=1) quotes = pd.read_csv('./data/sh000001.csv', parse_dates=['date']) quotes.set_index('date', inplace=True) sentiment_idx.index = pd.to_datetime(sentiment_idx.index) merged = pd.merge(sentiment_idx, quotes, how='left', left_index=True, right_index=True) merged.fillna(method='ffill', inplace=True) merged['BI_MA'] = merged['BI'].rolling(window=10, center=False).mean() merged['BI_Simple_MA'] = merged['BI_Simple'].rolling(window=10, center=False).mean() merged.to_csv('merged_sentiment_idx.csv')

时间: 2024-03-29 19:36:04 浏览: 334

这段代码是关于股票情绪指数的计算和处理，包括读取股票评论数据并按日期分组，定义了两个函数用于计算简单的情绪指数和 BI 指数，然后将这些指数与股票行情数据进行合并并计算移动平均数，最后将结果保存到 csv 文件中。其中，BI 指数是一种用于衡量市场情绪的指标，通过计算买方和卖方之间的力量平衡来反映市场的情绪状态。

result = pd.DataFrame({'第一次打卡时间': first_checkin, '最后一次打卡时间': last_checkin})

As an AI language model, I don't have the context to understand what kind of data is being analyzed here. However, based on the given code, it seems like a pandas DataFrame is being created with two columns: '第一次打卡时间' (first_checkin) and '最后一次打卡时间' (last_checkin). The data for these columns is not provided in the given code.

sp = data_train.target data_train.target1 =np.power(1.5,sp) print(data_train.target1.describe())

This code calculates the target1 variable for the data_train dataset, which is the target variable (sp) raised to the power of 1.5. Then, it uses the describe() method to get descriptive statistics of the target1 variable. The describe() method provides information such as the count, mean, standard deviation, minimum value, 25th percentile, median (50th percentile), 75th percentile, and maximum value of the target1 variable. It is important to note that the interpretation of these descriptive statistics depends on the specific dataset and problem being analyzed.

阅读全文

result = pd.DataFrame({'第一次打卡时间': first_checkin, '最后一次打卡时间': last_checkin})

sp = data_train.target data_train.target1 =np.power(1.5,sp) print(data_train.target1.describe())

相关推荐

"Java_ssm医院门诊挂号系统设计与实现：JSP技术与Mysql数据库应用

Oracle 11.2.0.3自动收集统计信息测试与操作

Lucene入门教程：从基础到实战应用

Alproximate.rar_EEG_The Program_approximate

geao-zeshing.rar_GEAR_bevel_spiral_spiral bevel gear

Path Planning.rar_PSO_Path-Planning_matlab pathplanning_pso path

fddoc.rar_MPC matlab_mpc controller _simulink mpc_statcom

ofdm.rar_convex optimization_ofdm adaptive_ofdm optimization_ofd

mimo.rar_MIMO capacity_mimo_mimo nr antenna_mimo water_water mim

DTC-gor-DFIG.rar_DFIG dtc_DTC ROTOR_between_dtc_dtc for dfig

mppt.rar_MPPT (P&O)_Modified_P&O_PCC_ofdm

compass_使用详解.pdf compass_教程 compass_试用案例

jexcelapi_2_6_4.tar.gz

Experimental_Security_Research_of_Tesla_Autopilot.pdf

StatFacts_Conditional_Value_at_Risk.pdf

【sre_parse错误诊断】：快速定位问题，掌握sre_parse调试与故障排除技巧

lab_pred = classifier.classifier(size, radius, samples, x1, 0.6, 0.05, num=3) length = len(set(lab_pred)) 解析

openpyxl.load_workbook(r'C:\同步空间\python\try12\123.xlsx')

大家在看

海思芯片规格对比.pdf

C#线上考试系统源码.zip

polsarpro官方教程、操作说明 PolSARpro v5.0 Software Training Course

人工智能技术在数值天气预报中的应用.zip

WRF model前处理.md

最新推荐

用python做一个搜索引擎(Pylucene)的实例代码

掌握HTML/CSS/JS和Node.js的Web应用开发实践

管理建模和仿真的文件

计算机体系结构概述：基础概念与发展趋势

int a[][3]={{1,2},{4}}输出这个数组

勒玛算法研讨会项目：在线商店模拟与Qt界面实现

"互动学习：行动中的多样性与论文攻读经历"

【计算机组成原理精讲】：从零开始深入理解计算机硬件

vue2加载高德地图

Edge语法革新：打造WPF界面新体验