训练神经网络时的Epoch vs Iteration

训练多层感知器时，历元和迭代的区别是什么?

当前回答

我认为迭代相当于批SGD中的单批正向+反向。Epoch将遍历整个数据集一次(正如其他人提到的那样)。

2015-06-16 20:55:20

其他回答

根据谷歌的机器学习术语表，一个纪元被定义为

“对整个数据集进行完整的训练，以便每个示例都被看到一次。因此，一个epoch表示N/batch_size训练迭代，其中N是示例的总数。”

如果你正在训练10个epoch的模型，批大小为6，给定总共12个样本，这意味着:

该模型将能够在2次迭代(12 / 6 = 2)即单个epoch中看到整个数据集。总的来说，该模型将有2 X 10 = 20个迭代(每个epoch的迭代X无epoch) 每次迭代后，将对损失和模型参数进行重新评估!

2020-12-30 05:43:21

我想在神经网络术语的背景下:

Epoch:当你的网络最终遍历整个训练集(即，每个训练实例一次)时，它完成了一个Epoch。

为了定义迭代(也就是步骤)，你首先需要知道批处理的大小:

Batch Size: You probably wouldn't like to process the entire training instances all at one forward pass as it is inefficient and needs a huge deal of memory. So what is commonly done is splitting up training instances into subsets (i.e., batches), performing one pass over the selected subset (i.e., batch), and then optimizing the network through backpropagation. The number of training instances within a subset (i.e., batch) is called batch_size. Iteration: (a.k.a training steps) You know that your network has to go over all training instances in one pass in order to complete one epoch. But wait! when you are splitting up your training instances into batches, that means you can only process one batch (a subset of training instances) in one forward pass, so what about the other batches? This is where the term Iteration comes into play: Definition: The number of forwarding passes (The number of batches that you have created) that your network has to do in order to complete one epoch (i.e., going over all training instances) is called Iteration.

例如，当你有10,000个训练实例，你想用10的大小进行批处理;你必须进行10,000/10 = 1,000次迭代才能完成1个epoch。

希望这能回答你的问题!

2020-04-03 17:28:33

Epoch is 1 complete cycle where the Neural network has seen all the data. One might have said 100,000 images to train the model, however, memory space might not be sufficient to process all the images at once, hence we split training the model on smaller chunks of data called batches. e.g. batch size is 100. We need to cover all the images using multiple batches. So we will need 1000 iterations to cover all the 100,000 images. (100 batch size * 1000 iterations) Once Neural Network looks at the entire data it is called 1 Epoch (Point 1). One might need multiple epochs to train the model. (let us say 10 epochs).

2019-09-23 22:58:25

时代对整个数据集进行完整的训练，使得每个例子已经见过一次了。因此，一个epoch表示N/batch 大小训练迭代，其中N是的总数的例子。迭代在训练过程中对模型权重的一次更新。迭代包括计算参数的梯度对于单批数据的损失。

奖金:

批处理在一次迭代中使用的示例集(即一个梯度) 更新)的模型训练。请参见批大小。

来源:https://developers.google.com/machine-learning/glossary/

2019-09-01 16:23:11

要理解它们之间的区别，你必须理解梯度下降算法及其变体。

在我开始回答这个问题之前，我想先了解一下背景。

批处理是完整的数据集。它的大小是可用数据集中训练示例的总数。

小批量大小是学习算法在单次传递(向前和向后)中处理的示例数量。

迷你批是给定迷你批大小的数据集的一小部分。

迭代是算法已经看到的数据批次的数量(或者简单地说，算法已经在数据集上完成的次数)。