ResNet18部署实战：Flask WebUI集成详细步骤

1. 背景与应用场景

1.1 通用物体识别的工程价值

在AI落地的众多场景中，通用图像分类是基础且高频的需求。从智能相册自动打标签、电商平台商品识别，到工业巡检中的异常物品检测，背后都离不开一个稳定、轻量、可快速部署的分类模型。

ResNet-18作为深度残差网络（Deep Residual Network）家族中最轻量级的经典结构之一，在精度与速度之间取得了极佳平衡。它在ImageNet上Top-5准确率超过90%，而模型体积仅约44MB（含权重），非常适合边缘设备或CPU环境下的推理服务。

本项目基于TorchVision官方实现的ResNet-18模型，构建了一个完整的本地化Web服务系统，具备以下核心优势：

✅ 内置预训练权重，无需联网加载
✅ 支持1000类常见物体和场景识别（如“alp”、“ski”）
✅ 集成Flask WebUI，支持图片上传与可视化结果展示
✅ CPU优化版本，低资源消耗，毫秒级响应

2. 技术架构设计

2.1 系统整体架构

整个系统采用前后端分离的轻量级架构，主要由三个模块组成：

[用户浏览器] ↓ (HTTP POST) [Flask Web Server] ←→ [PyTorch + ResNet-18 模型] ↓ (HTML 渲染) [返回 Top-3 分类结果]

前端：使用原生HTML+CSS+JavaScript实现简洁交互界面
后端：基于Flask框架接收请求、调用模型推理
模型层：加载TorchVision提供的resnet18(pretrained=True)，并进行推理封装

该架构无需GPU依赖，完全可在普通x86服务器或笔记本电脑上运行。

2.2 关键技术选型对比

组件	可选方案	最终选择	原因
框架	FastAPI / Django / Flask	Flask	轻量、易集成、适合小型Web服务
模型来源	自定义训练 / TorchVision / HuggingFace	TorchVision	官方维护、接口标准、稳定性高
推理设备	GPU / CPU	CPU	成本低、易于部署、ResNet-18对算力要求不高
前端交互	React / Vue / 原生HTML	原生HTML+JS	快速原型、减少依赖

📌为什么选择TorchVision而非HuggingFace？
尽管HuggingFace提供了丰富的模型生态，但对于标准CV任务（如ImageNet分类），TorchVision仍是更稳妥的选择：
直接绑定PyTorch版本，兼容性更好
torchvision.models.resnet18(pretrained=True)一行代码即可加载官方权重
不涉及权限校验或token验证，避免“模型不存在”报错

3. 核心实现步骤

3.1 环境准备与依赖安装

首先创建独立Python环境，并安装必要库：

python -m venv resnet-env source resnet-env/bin/activate # Linux/Mac # 或 resnet-env\Scripts\activate # Windows pip install torch torchvision flask pillow numpy

⚠️ 注意：确保PyTorch版本与torchvision匹配。推荐使用最新稳定版：
bash pip install torch==2.0.1 torchvision==0.15.2

3.2 模型加载与预处理封装

我们将模型加载逻辑封装为一个独立函数，包含必要的图像预处理流程。

import torch import torchvision.models as models from torchvision import transforms from PIL import Image import json # 加载ImageNet类别标签 with open("imagenet_classes.txt", "r") as f: categories = [line.strip() for line in f.readlines()] # 初始化模型（仅一次） model = models.resnet18(weights='IMAGENET1K_V1') # 官方预训练权重 model.eval() # 图像预处理管道 preprocess = transforms.Compose([ transforms.Resize(256), transforms.CenterCrop(224), transforms.ToTensor(), transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]), ]) def predict_image(image_path): image = Image.open(image_path).convert("RGB") input_tensor = preprocess(image) input_batch = input_tensor.unsqueeze(0) # 添加batch维度 with torch.no_grad(): output = model(input_batch) probabilities = torch.nn.functional.softmax(output[0], dim=0) top3_prob, top3_catid = torch.topk(probabilities, 3) results = [] for i in range(top3_prob.size(0)): cat_name = categories[top3_catid[i]].split(",")[0] # 取主名称 score = float(top3_prob[i].cpu().numpy()) results.append({"label": cat_name, "score": round(score * 100, 2)}) return results

📌关键说明： -weights='IMAGENET1K_V1'是当前推荐方式，替代已弃用的pretrained=True-imagenet_classes.txt文件需提前准备好，每行对应一个类别（共1000行） - 使用torch.no_grad()禁用梯度计算以提升推理效率

3.3 Flask Web服务搭建

接下来构建Flask应用，提供/upload接口用于接收图片并返回识别结果。

from flask import Flask, request, render_template, redirect, url_for import os from werkzeug.utils import secure_filename app = Flask(__name__) app.config['UPLOAD_FOLDER'] = 'static/uploads' app.config['MAX_CONTENT_LENGTH'] = 16 * 1024 * 1024 # 限制上传大小为16MB # 确保上传目录存在 os.makedirs(app.config['UPLOAD_FOLDER'], exist_ok=True) @app.route("/", methods=["GET"]) def index(): return render_template("index.html") @app.route("/predict", methods=["POST"]) def predict(): if "file" not in request.files: return redirect(request.url) file = request.files["file"] if file.filename == "": return redirect(request.url) if file: filename = secure_filename(file.filename) filepath = os.path.join(app.config['UPLOAD_FOLDER'], filename) file.save(filepath) try: results = predict_image(filepath) return render_template("result.html", image_url=filepath, results=results) except Exception as e: return f"推理出错: {str(e)}", 500 if __name__ == "__main__": app.run(host="0.0.0.0", port=5000, debug=False)

3.4 前端页面开发

`templates/index.html`

<!DOCTYPE html> <html> <head> <title>👁️ AI万物识别 - ResNet-18</title> <style> body { font-family: Arial; text-align: center; margin-top: 50px; } .upload-box { border: 2px dashed #ccc; padding: 30px; width: 400px; margin: 0 auto; } button { background: #007bff; color: white; padding: 10px 20px; border: none; margin-top: 20px; cursor: pointer; } </style> </head> <body> <h1>👁️ AI 万物识别</h1> <p>上传一张图片，让ResNet-18告诉你它是什么</p> <div class="upload-box"> <form method="POST" action="/predict" enctype="multipart/form-data"> <input type="file" name="file" accept="image/*" required> <br><br> <button type="submit">🔍 开始识别</button> </form> </div> </body> </html>

`templates/result.html`

<!DOCTYPE html> <html> <head> <title>识别结果 - ResNet-18</title> <style> body { font-family: Arial; margin: 40px; } .result { display: flex; gap: 30px; align-items: center; } img { max-width: 300px; border-radius: 8px; } .list { background: #f8f9fa; padding: 20px; border-radius: 8px; } .item { margin: 10px 0; font-size: 1.1em; } .label { font-weight: bold; } .score { color: #007bff; } </style> </head> <body> <a href="/">← 返回上传页</a> <h1>🎯 识别结果</h1> <div class="result"> <img src="{{ image_url }}" alt="Uploaded Image"> <div class="list"> {% for r in results %} <div class="item"> <span class="label">{{ r.label }}</span>: <span class="score">{{ r.score }}%</span> </div> {% endfor %} </div> </div> </body> </html>

4. 性能优化与实践建议

4.1 CPU推理加速技巧

虽然ResNet-18本身较轻，但在生产环境中仍可通过以下手段进一步提升性能：

启用TorchScript编译

python scripted_model = torch.jit.script(model) scripted_model.save("resnet18_scripted.pt")

后续直接加载脚本化模型，减少Python解释开销。

使用ONNX Runtime（可选）

将模型导出为ONNX格式，利用ONNX Runtime进行跨平台高效推理：

python torch.onnx.export(model, input_batch, "resnet18.onnx", opset_version=11)

批处理优化（Batch Inference）

若有并发请求，可累积多个图像合并为batch输入，提高吞吐量。

4.2 实际部署注意事项

问题	解决方案
首次启动慢	缓存模型加载结果，避免重复初始化
内存泄漏	使用Gunicorn + Werkzeug时注意多worker共享模型问题
文件名冲突	使用UUID重命名上传文件
安全风险	限制上传类型（只允许.jpg/.png等）