可解释的机器学习：深入理解黑盒模型

需积分: 5 55 浏览量更新于2024-06-16 收藏 12.22MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"《可解释的机器学习：黑盒模型可解释性理解指南》第二版，作者Christoph Molnar，330页PDF" 本书深入探讨了如何理解和解释那些通常被视为"黑盒"的机器学习模型。作者Christoph Molnar是可解释机器学习领域的专家，他在这本指南中提供了丰富的见解和实用方法。在开篇，作者通过故事引入主题，让读者明白机器学习的实际应用及其可能带来的问题。接着，书中详细介绍了机器学习的基础，包括它是什么，以及相关的术语，这对于初学者和经验丰富的从业者都是一个很好的回顾。 “解释性机器学习”是本书的核心部分。Molnar强调了解释性的重要性，特别是在决策至关重要的领域，如医疗保健和金融。他提出了一种解释性方法的分类，涵盖了各种技术，如局部解释、全局解释、模型简化和特征重要性等。同时，他还讨论了解释性的范围，以及如何评估一个模型的解释能力。书中指出，好的解释应该具备某些属性，比如可理解性、稳定性和鲁棒性。解释不仅要对机器有意义，更要对人类用户友好，因为最终的决策往往由人来做出。因此，书中强调了创建适合人类理解的解释的重要性。在数据集章节中，作者使用实际案例，如自行车租赁数据（回归问题）和YouTube垃圾评论分类（文本分类问题），来展示如何在不同类型的机器学习任务中应用解释性技术。这些案例研究帮助读者将理论知识与实践应用相结合，增强理解。此外，书中还涵盖了数据预处理、特征工程、模型选择以及如何处理高维和非结构化数据等方面，这些都是确保模型解释性的重要步骤。Molnar鼓励读者思考在追求模型性能的同时，如何保持模型的透明度和解释性，以实现更负责任和可靠的机器学习实践。《可解释的机器学习》第二版是一本全面而深入的指南，对于想要深入理解黑盒模型内部运作机制，以及如何提供可解释的预测的读者来说，是一份宝贵的资源。无论是研究人员、数据科学家还是企业决策者，都能从中受益。

资源详情

资源推荐

6 Chapter 1 Introduction

her public transport account did not have sucient tokens. She looked at her smartwatch to

check the account balance.

“Login denied. Please contact your Citizens Advice Bureau!” her watch informed her.

A feeling of nausea hit her like a st to the stomach. She suspected what had happened. To

conrm her theory, she started the mobile game “Sniper Guild”, an ego shooter. The app

was directly closed again automatically, which conrmed her theory. She became dizzy and

sat down on the oor again.

There was only one possible explanation: Her Civic Trust Score had dropped. Substantially.

A small drop meant minor inconveniences, such as not getting rst class ights or having to

wait a little longer for ocial documents. A low trust score was rare and meant that you

were classied as a threat to society. One measure in dealing with these people was to keep

them away from public places such as the subway. The government restricted the nancial

transactions of subjects with low Civic Trust Scores. They also began to actively monitor

your behavior on social media and even went as far as to restrict certain content, such as

violent games. It became exponentially more dicult to increase your Civic Trust Score the

lower it was. People with a very low score usually never recovered.

She could not think of any reason why her score should have fallen. The score was based

on machine learning. The Civic Trust Score System worked like a well-oiled engine that ran

society. The performance of the Trust Score System was always closely monitored. Machine

learning had become much better since the beginning of the century. It had become so ecient

that decisions made by the Trust Score System could no longer be disputed. An infallible

system.

She laughed in despair. Infallible system. If only. The system has rarely failed. But it

failed. She must be one of those special cases; an error of the system; from now on an outcast.

Nobody dared to question the system. It was too integrated into the government, into society

itself, to be questioned. In the few remaining democratic countries it was forbidden to form

anti-democratic movements, not because they where inherently malicious, but because they

would destabilize the current system. The same logic applied to the now more common

algocraties. Critique in the algorithms was forbidden because of the danger to the status

quo.

Algorithmic trust was the fabric of the social order. For the common good, rare false trust

scorings were tacitly accepted. Hundreds of other prediction systems and databases fed into

the score, making it impossible to know what caused the drop in her score. She felt like a big

dark hole was opening in and under her. With horror she looked into the void.

Her tax anity system was eventually integrated into the Civic Trust Score System, but she

never got to know it.

Fermi’s Paperclips

Year 612 AMS (after Mars settlement): A museum on Mars

1.2 What Is Machine Learning? 9

1.2 What Is Machine Learning?

Machine learning is a set of methods that computers use to make and improve predictions or

behaviors based on data.

For example, to predict the value of a house, the computer would learn patterns from past

house sales. The book focuses on supervised machine learning, which covers all prediction

problems where we have a dataset for which we already know the outcome of interest (e.g. past

house prices) and want to learn to predict the outcome for new data. Excluded from super-

vised learning are for example clustering tasks (= unsupervised learning) where we do not

have a specic outcome of interest, but want to nd clusters of data points. Also excluded

are things like reinforcement learning, where an agent learns to optimize a certain reward by

acting in an environment (e.g. a computer playing Tetris). The goal of supervised learning is

to learn a predictive model that maps features of the data (e.g. house size, location, oor type,

…) to an output (e.g. house price). If the output is categorical, the task is called classication,

and if it is numerical, it is called regression. The machine learning algorithm learns a model

by estimating parameters (like weights) or learning structures (like trees). The algorithm is

guided by a score or loss function that is minimized. In the house value example, the machine

minimizes the dierence between the estimated house price and the predicted price. A fully

trained machine learning model can then be used to make predictions for new instances.

Estimation of house prices, product recommendations, street sign detection, credit default

prediction and fraud detection: All these examples have in common that they can be solved

by machine learning. The tasks are dierent, but the approach is the same:

Step 1: Data collection. The more, the better. The data must contain the outcome you want

to predict and additional information from which to make the prediction. For a street sign

detector (“Is there a street sign in the image?”), you would collect street images and label

whether a street sign is visible or not. For a credit default predictor, you need past data on

actual loans, information on whether the customers were in default with their loans, and data

that will help you make predictions, such as income, past credit defaults, and so on. For an

automatic house value estimator program, you could collect data from past house sales and

information about the real estate such as size, location, and so on.

Step 2: Enter this information into a machine learning algorithm that generates a sign detector

model, a credit rating model or a house value estimator.

Step 3: Use model with new data. Integrate the model into a product or process, such as a

self-driving car, a credit application process or a real estate marketplace website.

Machines surpass humans in many tasks, such as playing chess (or more recently Go) or pre-

dicting the weather. Even if the machine is as good as a human or a bit worse at a task, there

remain great advantages in terms of speed, reproducibility and scaling. A once implemented

machine learning model can complete a task much faster than humans, reliably delivers con-

sistent results and can be copied innitely. Replicating a machine learning model on another

machine is fast and cheap. The training of a human for a task can take decades (especially

when they are young) and is very costly. A major disadvantage of using machine learning is

that insights about the data and the task the machine solves is hidden in increasingly complex

models. You need millions of numbers to describe a deep neural network, and there is no way

剩余328页未读，继续阅读

死磕代码程序媛

粉丝: 109
资源: 316

会员权益专享

可解释的机器学习：深入理解黑盒模型

《可解释机器学习：模型、方法与实践》读书笔记模板.pptx

机器学习模型可解释性方法、应用与安全研究综述.pdf

机器学习的可解释性.pdf

深度学习解释性：医疗保健中的黑盒模型解释和可解释性

可解释性机器学习：深入理解模型决策过程

交互式模型分析：视觉理解、诊断和改进机器学习模型

"将BlackboX变成GlassboX：可解释的机器学习方法在理解酒店客户体验中的应用

Proto2Proto：模型可解释性的转移方法

20页以上，关于可解释性机器学习，介绍其基本属性与分类、应用场景

如何提高机器学习模型的可解释性和可信度

可解释性机器学习前沿方向

机器翻译:基础与模型pdf

帮我做一篇关于可解释性机器学习分类的ppt，要求20页以上，关于可解释性机器学习，介绍其基本属性与分类、应用场景

用python写一个时间序列预测模型，并进行可解释性分析

机器学习的可解释学习

EBM模型的局部可解释性如何代码实现

机器学习学习笔记.pdf

机器学习数学理论 pdf

1.阐述自己对机器学习的理解，说明机器学习模型是如何学习的。 2.解释神经网络的原理，以及深度学习的概念。 3.谈谈你对机器学习及人工智能现状和未来发展的看法。字数不得少于2000字。

代理模型解释机器学习_向企业解释机器学习模型

会员权益专享

最新资源