用Python轻松获取王者荣耀英雄皮肤图片：一网打尽！

2023-10-25 16:32:31

使用 Python 从王者荣耀爬取英雄皮肤图片：一步一步指南

前言

王者荣耀，风靡全球的多人在线战斗竞技场（MOBA）手游，以其种类繁多的英雄和引人入胜的皮肤而闻名。对于游戏爱好者来说，收集这些皮肤图片用于壁纸、粉丝艺术或数据分析是一个常见的需求。通过利用 Python 强大的网络抓取功能，我们可以轻松从王者荣耀官方网站下载所有这些图像。

环境设置

在开始本教程之前，请确保你已准备好以下内容：

Python 3.8 或更高版本
PyCharm IDE
已安装 requests 库（使用 pip install requests）

使用 Requests 库抓取图片

Requests 库是一个 Python 库，用于向网站发送 HTTP 请求并获取响应。它使得从网络上获取数据变得非常简单。

1. 获取英雄列表

第一步是获取所有英雄的列表。我们可以使用 requests 库向王者荣耀官方网站的英雄列表页面发送 GET 请求。

import requests

# 发送 GET 请求到英雄列表页面
response = requests.get("https://pvp.qq.com/web201605/js/herolist.json")

# 解析 JSON 响应
hero_list = response.json()

2. 遍历英雄列表

接下来，我们需要遍历英雄列表，为每个英雄抓取其皮肤图片。

for hero in hero_list:
    # 获取英雄名称和英雄 ID
    hero_name = hero["ename"]
    hero_id = hero["ename"].lower()

    # 发送 GET 请求到英雄皮肤页面
    skin_response = requests.get(f"https://game.gtimg.cn/images/yxzj/img201606/skin/hero-info/{hero_id}/10001.jpg")

    # 保存皮肤图片
    with open(f"skins/{hero_name}.jpg", "wb") as f:
        f.write(skin_response.content)

使用正则表达式过滤图片链接

为了提高爬虫的效率和准确性，我们可以使用正则表达式来过滤出皮肤图片链接。

import re

# 正则表达式模式
pattern = r"https://game\.gtimg\.cn/images/yxzj/img201606/skin/hero-info/.*\.jpg"

# 从英雄皮肤页面中提取图片链接
skin_urls = re.findall(pattern, skin_response.text)

# 保存皮肤图片
for url in skin_urls:
    # 获取文件名
    filename = url.split("/")[-1]

    # 发送 GET 请求并保存图片
    skin_response = requests.get(url)
    with open(f"skins/{filename}", "wb") as f:
        f.write(skin_response.content)

示例代码

以下是如何将所有内容放在一起的示例代码：

import requests
import re

# 发送 GET 请求到英雄列表页面
hero_response = requests.get("https://pvp.qq.com/web201605/js/herolist.json")

# 解析 JSON 响应
hero_list = hero_response.json()

# 正则表达式模式
pattern = r"https://game\.gtimg\.cn/images/yxzj/img201606/skin/hero-info/.*\.jpg"

# 遍历英雄列表
for hero in hero_list:
    # 获取英雄名称和英雄 ID
    hero_name = hero["ename"]
    hero_id = hero["ename"].lower()

    # 发送 GET 请求到英雄皮肤页面
    skin_response = requests.get(f"https://game.gtimg.cn/images/yxzj/img201606/skin/hero-info/{hero_id}/10001.jpg")

    # 从页面中提取图片链接
    skin_urls = re.findall(pattern, skin_response.text)

    # 保存皮肤图片
    for url in skin_urls:
        # 获取文件名
        filename = url.split("/")[-1]

        # 发送 GET 请求并保存图片
        skin_response = requests.get(url)
        with open(f"skins/{filename}", "wb") as f:
            f.write(skin_response.content)