半监督多图分类：最优特征选择与极限学习机结合

181 浏览量更新于2024-08-26 收藏 2.02MB PDF 举报

"这篇研究论文探讨了在半监督学习环境下，如何利用最优特征选择和极限学习机进行多图分类的问题。作者包括Jun Pang、Yu Gu、Jia Xu和Ge Yu，分别来自中国不同高校的计算机科学和技术学院。文章发表在2018年的《神经计算》期刊上，编号为Neurocomputing 277，页码89–100。关键词涵盖了多图、半监督学习、特征选择和极限学习机。" 正文：在半监督多图分类问题中，由于部分数据具有标签，而其他数据没有，这种学习环境旨在通过利用未标记数据与有限的标记数据之间的关系来提升模型的分类性能。多图是指一组包含多个相互关联的图，每个图可能表示不同的视角或属性，例如社交网络中的用户关系图、基因交互图等。在这些复杂的数据结构中，寻找有效的分类方法至关重要。论文提出的解决方案结合了最优特征选择和极限学习机（Extreme Learning Machine, ELM）两个关键概念。最优特征选择是机器学习预处理步骤的一部分，旨在从原始特征集合中找出最具代表性和影响力的特征子集。这通常通过评估特征的相关性、独立性和对目标变量的影响力来实现，目的是减少过拟合风险，提高模型的泛化能力，并减少计算复杂度。极限学习机是一种快速、高效的单隐藏层前馈神经网络训练算法。ELM通过随机初始化隐藏层权重和偏置，然后仅优化输出层权重来达到近似全局最优解，避免了传统神经网络训练中的梯度下降迭代过程。这种方法在处理大规模数据和高维特征时表现出色，尤其适合半监督学习场景，因为其能在有限的标记样本上快速收敛。该论文中，最优特征选择应用于多图的各个图上，选取最有区分力的特征，减少冗余信息，从而提升后续分类器（即ELM）的性能。接着，这些精选的特征被输入到ELM中，通过半监督学习策略，将标记和未标记的数据一起用于模型训练。ELM能够有效地捕捉图之间的拓扑结构和节点间的关系，以提升分类效果。半监督学习是介于有监督和无监督学习之间的一种学习方法，它利用大量未标记数据的潜在结构和少量标记数据的信息。在这种情况下，ELM作为分类器，可以利用未标记数据的分布信息，帮助模型在训练过程中不断调整和优化，从而提高分类准确率。这篇研究论文提出了一个创新的组合方法，通过最优特征选择优化输入，结合极限学习机的强大分类能力，解决了半监督多图分类的挑战。这种方法不仅提高了分类效率，还提升了在数据标注不完全情况下的模型性能，具有重要的理论价值和实际应用前景。

J. Pang et al. / Neurocomputing 277 (2018) 89–100 91

unlabeled classiﬁcation algorithms, puMGL contains two iterative

processes: (1) selecting subgraph features and (2) deriving distin-

guish model. The problem setting of puMGL is different from our

problem, since it supposes that training datasets contain a small

quantity of positive and a large number of unlabeled multi-graphs.

While, in our works, we suppose that training datasets contain a

small quantity of labeled multi-graphs for every category and a

large number of unlabeled multi-graphs. Thus, puMGL framework

can not be directly applied to solve our problem.

2.2. Subgraph feature selection

Existing subgraph feature selection algorithms can be divided

into three types: unsupervised algorithms, semi-supervised algo-

rithms and supervised algorithms.

The ﬁrst type of algorithms treats frequent subgraphs as sub-

graph features. For example, the top- m frequent subgraphs are

used as subgraph features. The second type of algorithms ﬁrst

mines frequent subgraphs, then mines subgraph features from

those frequent subgraphs using labeled and unlabeled graph infor-

mation. For example, Kong and Yu [3] propose the semi-supervised

subgraph feature selection algorithm gSSC. Because gSSC not only

considers labeled graphs, but also considers unlabeled graphs

when selecting subgraph feature, it can derive more valuable

subgraph features than other algorithms. Meanwhile, an upper

bound pruning strategy is proposed to improve the eﬃciency of

gSSC. The last type of algorithms indirectly or directly mines sub-

graph features based on many labeled graphs. For example, Yan

et al. [7] propose an novel mining framework LEAP(Descending

leap mine) to identify the feature subgraphs. Moreover, two new

techniques, structural proximity pruning and frequency-descending

mining, are proposed to support leap search in graph pattern

space. In addition, in order to speed up the process of mining sub-

graph features and improve the quality of subgraph features, many

other techniques are adopted, such as partitioning technique based

on similarity [8] , fast probing technique based on search history

[9] and diversity technique [10] .

Although the ﬁrst type of algorithms do not require the labels

of graphs in the graph datasets, experimental results show that

its accuracy is far from being satisﬁed. Because informative sub-

graph features may not only be frequent subgraphs but also be

infrequent ones, it is not appropriate to use only the frequency

to select the subgraph features. Although the second type of algo-

rithms only require the graph datasets containing a small number

of labeled graphs, they cannot be directly used to solve the semi-

supervised multi-graph classiﬁcation problem because of the dif-

ferences between the multi-graph and the graph. Compared to the

graph, the multi-graph not only maintains the containment rela-

tionship between the multi-graph and the graphs, but also has the

mutual constraints between the graph label and the multi-graph

label. Also, the third type of algorithms are unsuitable for directly

solving the semi-supervised multi-graph classiﬁcation problem be-

cause they demand that the training datasets contain a large num-

ber of labeled multi-graphs. In this paper, we propose an novel

subgraph feature selection algorithm, which is specially designed

for the multi-graph setting. Meanwhile, our algorithm considers

both of the constrains of the graph level and the constrains of the

multi-graph level.

2.3. Semi-supervised extreme learning machine

Huang et al. propose ELM for single hidden-layer feedforward

neural networks(SLF-Ns) and then extend it to the “generalized

 

SLFNs [11–25] . ELM has better generalization performance, faster

learning speed and higher training precision than traditional feed-

forward neural networks. ELM algorithm has a wide range of appli-

cations, such as protein secondary structure prediction [26] , clas-

siﬁcation in P2P networks [27] , XML document classiﬁcation [28] ,

graph classiﬁcation [29] , and multi-graph classiﬁcation [30] .

As a promising algorithm, semi-supervised ELM has begun to

attract more researchers’ attentions. Liu et al. [31] introduce the

manifold regularization framework into the ELMs model to solve

semi-supervised binary classiﬁcation problem. However, the algo-

rithm is ineﬃcient when the number of hidden neurons is larger

than the number of training patterns. Li et al. [32] design a co-

training method to repeatedly train ELMs in a semi-supervised set-

ting. The algorithm is ineﬃcient because of its strategy of repeat-

edly training. Huang et al. [33] extend ELMs for semi-supervised

task based on the manifold regularization. The proposed algorithm

i.e., SS-ELM (semi-supervised ELM), can handle multi-class clas-

siﬁcation. Liu et al. [6] propose ESELM(extended semi-supervised

ELM) considering the empirical risk and structural risk at the same

time. ESELM ﬁts for both low dimension dataset and high dimen-

sion dataset. Extensive experimental results show the effectiveness

of ESELM. ESELM is inclined to the graph structure while SS-ELM

describes the form of semi-supervised method [6] . To our best

knowledge, ESELM is the latest semi-supervised ELM algorithm. In

this paper, we choose ESELM to build our prediction model.

3. Problem deﬁnition

In this section, we introduce related concepts and formulate our

problem.

Deﬁnition 1. (Connected graph)

A graph G is described by a quaternion < V, E , , f > , where

V and E denote the set of vertices and edges, respectively.  indi-

cates the label range of vertices and edges. f is a label function

which allocates a label for every vertex and edge. A connected

graph is a graph in which there exists at least a path between any

two vertices. Under the condition without ambiguity, a connected

graph are called a graph for short in this paper. v

denotes the i th

vertex of G and e ( v

, v

) indicates the edge between v

and v

. The

label of vertex v

is represented as f ( v

), the label of edge e ( v

, v

) is

represented as f ( e ( v

, v

)), and l ( G ) denotes the class label of graph

G .

Deﬁnition 2. (SubGraph)

Graph G



is a subgraph of graph G if G and G



satisfy two con-

ditions: (1) for any vertex v

∈ V of G



, f (v

) = f (v

) where v

∈ V

of G . (2) f (e (v

, v

)) = f (e (v

, v

)) , where e ( v

, v

) ∈ E of G



, e ( v

) ∈ E of G and f (v

) = f (v

) , f (v

) = f (v

) . G



⊆G denotes that



is a subgraph of G .

Deﬁnition 3. (Labeled multi-graph)

A multi-graph is a bag of graphs MG = { G

, . . . , G

, . . . ,

| MG |

} (1 < i < | MG | ) . A labeled multi-graph is a multi-graph with

binary class label l(MG ) ∈ { posit i v e (+) , negat i v e (−) } . If the class

label for one graph of a multi-graph is tagged as positive,

the multi-graph is tagged as positive i.e., ∃ G ∈ MG and l(G ) =

positi v e ⇒ l(MG ) = positi v e . Otherwise, the multi-graph is tagged

as negative, i.e., ∀ G ∈ MG and l(G ) = negati v e ⇒ l(MG ) = negati v e .

Deﬁnition 4. (Optimal subgraph feature selection)

Given a multi-graph set M G

= { M G

, . . . , M G

| MG

} ,

’s graph set G

= { G | G ∈ M G

, M G

∈ M

} , G

’s subgraph set

SG = { Sg| Sg ⊆ G, G ∈ G

} . Optimal subgraph feature selection aims

to ﬁnd the most valuable subgraph feature set F ⊆SG , which are

most useful to distinguish multi-graphs, shown as follows.

F S = arg max S(F ) , s.t. | F | = m. (1)

where S ( F ) denotes an evaluation criterion to estimate the useful-

ness of F , | F | denotes the cardinality of F and m is the maximum

number of selected features.

剩余11页未读，继续阅读

weixin_38610012

粉丝: 3
资源: 866

半监督多图分类：最优特征选择与极限学习机结合

半监督式深度极限学习机，用于基于Wi-Fi的本地化

极限学习机的代码

极限学习机ELM+OSELM+KELM+半监督SSELM+USELM的matlab程序(附完整代码)

基于GOA蝗虫优化的KNN分类器最优特征选择算法的matlab仿真

基于粒子群优化算法的最优极限学习机.pdf

matlab-基于GOA蝗虫优化的KNN分类器最优特征选择算法的matlab仿真-源码

自适应单次运动想像最优特征提取研究.zip_信号特征提取_脑机接口_脑电能量_运动想像特征_运动想象 分类

半监督学习的正交最优反向预测。 模式识别

论文研究-基于最优投影的半监督谱聚类算法.pdf

粒子群算法优化极限学习机PSO_ELM.zip_PSO ELM_extreme learning _优化elm_极限学习机_极限

最新资源

自适应单次运动想像最优特征提取研究.zip_信号特征提取_脑机接口_脑电能量_运动想像特征_运动想象分类

半监督学习的正交最优反向预测。模式识别