机器学习词典(The Machine Learning Dictionary)3_分类词汇_双语文档

章节正文

entropy

For our purposes, the entropy measure

–Σ_ip_ilog₂p_i

gives us the average amount of information in bits in some attribute of an instance. The information referred to is information about what class the instance belongs to, and p_i is the probability that an instance belongs to class i.

The rationale for this is as follows: –log₂(p) is the amount of information in bits associated with an event of probabilityp - for example, with an event of probability ½, like flipping a fair coin, log₂((p) is –log₂(½) = 1, so there is one bit of information. This should coincide with our intuition of what a bit means (if we have one). If there is a range of possible outcomes with associated probabilities, then to work out the average number of bits, we need to multiply the number of bits for each outcome (–log₂(p)) by the probabilityp and sum over all the outcomes. This is where the formula comes from.

Entropy is used in the ID3 decision tree induction algorithm.

epoch

In training a neural net, the term epoch is used to describe a complete pass through all of the training patterns. The weights in the neural net may be updated after each pattern is presented to the net, or they may be updated just once at the end of the epoch. Frequently used as a measure of speed of learning - as in "training was complete after x epochs".

error backpropagation learning algorithm

The error backpropagation learning algorithm is a form of supervised learning used to train mainly feedforward neural networks to perform some task. In outline, the algorithm is as follows:

Initialization: the weights of the network are initialized to small random values.
Forward pass: The inputs of each training pattern are presented to the network. The outputs are computed using the inputs and the current weights of the network. Certain statistics are kept from this computation, and used in the next phase. The target outputs from each training pattern are compared with the actual activation levels of the output units - the difference between the two is termed the error. Training may be pattern-by-pattern or epoch-by-epoch. With pattern-by-pattern training, the pattern error is provided directly to the backward pass. With epoch-by-epoch training, the pattern errors are summed across all training patterns, and the total error is provided to the backward pass.
Backward pass: In this phase, the weights of the net are updated. See the main article on the backward pass for some more detail.
Go back to step 2. Continue doing forward and backward passes until the stopping criterion is satisfied.

See also forward pass, backward pass, delta rule, error surface, local minimum, gradient descent and momentum.

Error backpropagation learning is often familiarly referred to just as backprop.

error surface

When total error of a backpropagation-trained neural network is expressed as a function of the weights, and graphed (to the extent that this is possible with a large number of weights), the result is a surface termed the error surface. The course of learning can be traced on the error surface: as learning is supposed to reduce error, when the learning algorithm causes the weights to change, the current point on the error surface should descend into a valley of the error surface.

The "point" defined by the current set of weights is termed a point in weight space. Thus weight space is the set of all possible values of the weights.

See also local minimum and gradient descent.

excitatory connection

see weight.

expected error estimate

In pruning a decision tree, one needs to be able to estimate the expected error at any node (branch or leaf). This can be done using the Laplace error estimate, which is given by the formula

E(S) = (N – n + k – 1) / (N + k).

where

S	is the set of instances in a node
k	is the number of classes (e.g. 2 if instances are just being classified into 2 classes: say positive and negative)
N	is the is the number of instances in S
C	is the majority class in S
n	out of N examples in S belong to C

measure [´meʒə] n.量度；范围 vt.测量 (初中英语单词)
amount [ə´maunt] n.总数；数量 v.合计 (初中英语单词)
attribute [ə´tribju:t] n.象征 vt.归因于 (初中英语单词)
instance [´instəns] n.例子，实例，例证 (初中英语单词)
multiply [´mʌltiplai] v.增加；倍增；繁殖 (初中英语单词)
learning [´lə:niŋ] n.学习；学问；知识 (初中英语单词)
mainly [´meinli] ad.主要地；大体上 (初中英语单词)
outline [´autlain] n.外形 vt.画出...轮廓 (初中英语单词)
actual [´æktʃuəl] a.现实的；实际的 (初中英语单词)
output [´autput] n.产品；产品；计算结果 (初中英语单词)
backward [´bækwəd] ad.向后 a.向后的 (初中英语单词)
function [´fʌŋkʃən] n.机能；职责 vi.活动 (初中英语单词)
extent [ik´stent] n.长度；程度；范围 (初中英语单词)
supposed [sə´pəuzd] a.想象的；假定的 (初中英语单词)
descend [di´send] v.下来，下降 (初中英语单词)
valley [´væli] n.谷；河谷；流域 (初中英语单词)
connection [kə´nekʃən] n.联系；关系；联运 (初中英语单词)
estimate [´estimət, ´estimeit] n.估计；评价 vt.估价 (初中英语单词)
probability [,prɔbə´biliti] n.或有；可能性 (高中英语单词)
coincide [,kəuin´said] vi.一致；重合 (高中英语单词)
formula [´fɔ:mjulə] n.公式；配方；原则 (高中英语单词)
random [´rændəm] n.偶然的行动 (高中英语单词)
minimum [´miniməm] n.最小量 a.最小的 (高中英语单词)
descent [di´sent] n.出身，家世 (高中英语单词)
positive [´pɔzətiv] a.确定的 (高中英语单词)
outcome [´autkʌm] n.结果；后果；成果 (英语四级单词)
network [´netwə:k] n.网状物 vt.联播 (英语四级单词)
statistics [stə´tistiks] n.统计学；统计 (英语四级单词)

文章标签:词典

章节正文

上传人

网友

栏目分类

英语能力

英语词汇

分类词汇

文章信息

浏览:72

机器学习词典(The Machine Learning Dictionary)3

上传人

栏目分类

文章信息

相关文档

特色课程

专题

热门标签