micee 发表于 2022-4-2 10:44

美女图爬虫

代码如下:
import requests
import re
page = 1
head = {
    "user-agent":"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/99.0.4844.74 Safari/537.36 Edg/99.0.1150.55"
}
print("1.性感美女 2.清纯可爱 3.性感御姐 4.制服诱惑")
ms = input("请选择爬取模式:")
url1 = "https://dimg.us/xinggan/page/"
url2 = "https://dimg.us/qingchun/page/"
url3 = "https://dimg.us/yujie/page/"
url4 = "https://dimg.us/zhifu/page/"
if int(ms) == 1:
    url = url1
elif int(ms) == 2:
    url = url2
elif int(ms) == 3:
    url = url3
elif int(ms) == 4:
    url = url4
hz = input("请输入保存图片的后缀名:")
bpath = input("请输入保存路径:")
while True:
    html = requests.get(url=url + str(page),headers=head)
    ren = re.compile('<img class="lazyload" data-src="(?P<tu>.*?)" src=".*?" alt="(?P<title>.*?)">',re.S)
    tq = ren.finditer(html.text)
    if html.status_code == 404:
      break
    for m in tq:
      tup = m.group("tu")
      tut = m.group("title")
      ptu = requests.get(tup)
      path = bpath + "\\" + tut + "." + hz
      open(path, "wb").write(ptu.content)
      print(tup)
    page = page + 1
    print(url1)

print("下载完成!")
html.close()


成品下载地址:
**** Hidden Message *****


yy2386861489 发表于 2022-4-2 10:44

6666

大雪之夜 发表于 2022-4-2 10:56

谢谢@Thanks!

XFDE 发表于 2022-4-2 11:04

谢谢分享

1913239648 发表于 2022-4-2 11:29

谢谢大佬

梦归雨巷 发表于 2022-4-2 11:33

谢谢大牛

787455958 发表于 2022-4-2 11:34

正需要,支持楼主,在大牛我只看好你!

mlj2003318 发表于 2022-4-2 12:35

感谢楼主的分享

zylz9941 发表于 2022-4-2 12:39

尊重原创者,更尊重搬运者,辛苦了

chunjay123 发表于 2022-4-2 14:36

666666
页: [1] 2 3
查看完整版本: 美女图爬虫