首页python关联规则分析案例

python关联规则分析案例

时间: 2024-10-09 10:13:42 浏览: 79

Python中的关联规则分析通常用于市场篮子分析，比如在零售行业，通过研究顾客购买的商品组合，找出哪些商品经常一起被购买，即所谓的“频繁项集”和“关联规则”。Apriori算法是常用的关联规则挖掘算法，在Python库如`mlxtend`、`frequent_patterns`或`pyfim`中有现成的支持。一个简单的例子可以是分析超市销售数据： ```python from mlxtend.preprocessing import TransactionEncoder from mlxtend.frequent_patterns import apriori, association_rules # 假设我们有如下的购物记录列表 transactions = [['Milk', 'Bread', 'Butter'], ['Eggs', 'Milk', 'Bread'], ['Apples', 'Oranges', 'Juice'], ['Bread', 'Butter', 'Ham'], ['Milk', 'Bread']] # 将交易数据转换为编码形式 te = TransactionEncoder() X = te.fit(transactions).transform(transactions) X = pd.DataFrame.sparse.from_spmatrix(X) # 使用Apriori算法找到频繁项集和关联规则 frequent_itemsets = apriori(X, min_support=0.6, use_colnames=True) rules = association_rules(frequent_itemsets, metric="lift", min_threshold=1.2) # 输出结果，例如支持度大于0.6并且提升度大于1.2的规则 print(rules) ``` 在这个例子中，`min_support`设置了最小的支持度阈值（例如60%），`min_threshold`设置了关联规则的置信度阈值（例如120%）。运行以上代码后，会看到哪些商品组合满足频繁性条件，并且规则强度较高。

阅读全文