Tensorflow2 如何扩展现有数据集(缩放、随机旋转、水平翻转、平移等)，从而提高模型的准确率 -- Tensorflow自学笔记14

实际生活中的数据集，往往不是标准的数据，而是有倾斜角度、有旋转、有偏移的数据，为了提高数据集的真实性，提高模型预测的准确率，可以用ImageDataGenerator函数来扩展数据集

import tensorflow as tffrom tensorflow.keras.preprocessing.image import ImageDataGeneratorimage_gen_train = ImageDataGenerator(rescale=1./255, #原像素值 0~255 归至 0~1 rotation_range=45, #随机 45 度旋转width_shift_range=.15, #随机宽度偏移 [-0.15,0.15)height_shift_range=.15,#随机高度偏移 [-0.15,0.15)horizontal_flip=True,#随机水平翻转zoom_range=0.5 #随机缩放到 [1-50%，1+50%]

MNIST数据集增强

import tensorflow as tffrom tensorflow.keras.preprocessing.image import ImageDataGeneratormnist = tf.keras.datasets.mnist(x_train, y_train), (x_test, y_test) = mnist.load_data()x_train, x_test = x_train / 255.0, x_test / 255.0x_train = x_train.reshape(x_train.shape[0], 28, 28, 1) # 数据增强函数的输入要求是 4 维，通过 reshape 调整，给数据增加一个维度,从(60000, 28, 28)reshape为(60000, 28, 28, 1)image_gen_train = ImageDataGenerator(rescale=1. / 1., # 如为图像，分母为255时，可归至0～1rotation_range=45, # 随机45度旋转width_shift_range=.15, # 宽度偏移height_shift_range=.15, # 高度偏移horizontal_flip=False, # 水平翻转zoom_range=0.5 # 将图像随机缩放阈量50％)image_gen_train.fit(x_train)model = tf.keras.models.Sequential([tf.keras.layers.Flatten(),tf.keras.layers.Dense(128, activation='relu'),tf.keras.layers.Dense(10, activation='softmax')])model.compile(optimizer='adam',loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=False),metrics=['sparse_categorical_accuracy'])model.fit(image_gen_train.flow(x_train, y_train, batch_size=32), epochs=5, validation_data=(x_test, y_test),validation_freq=1)model.summary()

数据增强后，图片对比，发现，有的旋转了，有的放大了，有的旋转了。