MNIST手撸教程：浅析CPU和GPU实现差异

人工智能

2024-01-07 10:51:26

前言

手写数字识别一直是深度学习入门经典任务。本文将带领大家动手撸一个MNIST手写数字分类器，并深入探讨CPU和GPU两种实现方式之间的差异。

数据简介

MNIST数据集包含7万张灰度图像，其中6万张为训练集，1万张为测试集。每个图像代表一个手写数字，类别共10个（0~9）。

模型构建

CPU实现

使用Python和TensorFlow构建MNIST分类器：

import tensorflow as tf

# 加载MNIST数据集
(x_train, y_train), (x_test, y_test) = tf.keras.datasets.mnist.load_data()

# 构建模型
model = tf.keras.models.Sequential([
    tf.keras.layers.Flatten(input_shape=(28, 28)),
    tf.keras.layers.Dense(128, activation='relu'),
    tf.keras.layers.Dropout(0.2),
    tf.keras.layers.Dense(10, activation='softmax')
])

# 编译模型
model.compile(optimizer='adam',
              loss='sparse_categorical_crossentropy',
              metrics=['accuracy'])

# 训练模型
model.fit(x_train, y_train, epochs=5)

# 评估模型
model.evaluate(x_test, y_test)

GPU实现

在GPU上运行模型，需要使用TensorFlow的GPU版本：

import tensorflow as tf

# 加载MNIST数据集
(x_train, y_train), (x_test, y_test) = tf.keras.datasets.mnist.load_data()

# 创建GPU设备
device_name = '/device:GPU:0'
with tf.device(device_name):
    # 构建模型
    model = tf.keras.models.Sequential([
        tf.keras.layers.Flatten(input_shape=(28, 28)),
        tf.keras.layers.Dense(128, activation='relu'),
        tf.keras.layers.Dropout(0.2),
        tf.keras.layers.Dense(10, activation='softmax')
    ])

# 编译模型
model.compile(optimizer='adam',
              loss='sparse_categorical_crossentropy',
              metrics=['accuracy'])

# 训练模型
model.fit(x_train, y_train, epochs=5)

# 评估模型
model.evaluate(x_test, y_test)