首页软件开发代码讲解 Python 正文

我要投稿

python爬取”顶点小说网“《纯阳剑尊》的示例代码

自学编程网 Python

2020-10-16 0 943

爬取”顶点小说网“《纯阳剑尊》

代码

import requests
from bs4 import BeautifulSoup
# 反爬
headers = {
  \'User-Agent\': \'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, \\
  like Gecko) Chrome/70.0.3538.102 Safari/537.36\'
}

# 获得请求
def open_url(url):
  response = requests.get(url, headers=headers)
  response.encoding = response.apparent_encoding
  html = response.text
  return html

# 提取标题
def get_title(url):
  soup = BeautifulSoup(url, \'lxml\')
  title_tag = soup.find(\'dd\')
  title = \'\\n\' + title_tag.h1.get_text() + \'\\n\'
  return title

# 提取文本
def get_texts(url):
  soup2 = BeautifulSoup(url, \'lxml\')
  text_tags = soup2.find_all(\'dd\', id=\"contents\")
  return text_tags

# 保存标题
def save_title(filename, title):
  with open(filename, \'a+\', encoding=\'utf-8\') as file:
    file.write(title)

# 保存文本
def save_text(filename, text):
  with open(filename, \'a+\', encoding=\'utf-8\') as file:
    file.write(text)

# 主程序函数
def main():
  num = input(\'《纯阳剑尊》你想要下载第几章？（1-802）\')
  num = int(num)
  number = 8184027 + num
  url = \'https://www.23us.so/files/article/html/15/15905/\' + str(number) + \'.html\'
  filename = \'纯阳剑尊.txt\'
  r = open_url(url)
  title = get_title(r)
  tags = get_texts(r)
  save_title(filename, title)
  for text_tag in tags:
    text = text_tag.get_text() + \'\\n\'
    save_text(filename, text)
  print(\'第{}章已经下载完成！\'.format(num))

if __name__ == \'__main__\':
  main()

爬取结果：

python爬取”顶点小说网“《纯阳剑尊》的示例代码

python爬取”顶点小说网“《纯阳剑尊》的示例代码

以上就是python爬取”顶点小说网“《纯阳剑尊》的示例代码的详细内容，更多关于python 爬取顶点小说网的资料请关注自学编程网其它相关文章！

收藏 (0) 点赞 (0)

遇见资源网 Python python爬取”顶点小说网“《纯阳剑尊》的示例代码 http://www.ox520.com/27379.html

Python 爬取小说纯阳剑尊爬取顶点小说网爬虫爬取小说

自学编程网

上一篇： python从Oracle读取数据生成图表

下一篇： Python通过format函数格式化显示值

常见问题

相关文章

python利用socket实现udp文件传输功能

python利用socket实现udp文件传输功能

Python

自学编程网

2年前 332

Python实现批量压缩文件/文件夹zipfile的使用

Python实现批量压缩文件/文件夹zipfile的使用

Python

自学编程网

2年前 547

python实现TCP文件接收发送

python实现TCP文件接收发送

Python

自学编程网

2年前 723

Python使用turtle模块绘制爱心图案

Python使用turtle模块绘制爱心图案

Python

自学编程网

2年前 393

猜你喜欢

python利用socket实现udp文件传输功能 2023-01-31
Python实现批量压缩文件/文件夹zipfile的使用 2023-01-31
python实现TCP文件接收发送 2023-01-31
Python使用turtle模块绘制爱心图案 2023-01-13
浅谈Python的字典键名可以是哪些类型 2023-01-13
Python日期时间模块arrow的具体使用 2023-01-13
python利用Appium实现自动控制移动设备并提取数据功能 2023-01-13
python用folium绘制地图并设置弹窗效果 2023-01-13
Python 面向对象编程的三大特性之继承 2023-01-13
利用Python快速绘制海报地图 2023-01-13

发表评论

暂无评论

官方客服团队

为您解决烦忧 - 24小时在线专业服务

联系官方团队在线提交工单

自学编程网

QQ 微信

微博

9875
文章
5,568,043
浏览
0
收藏
0
评论
23133
标签
19
分类

进主页

TA的动态

2023-03-16 一篇文章带你了解如何正确使用java线程池
2023-03-16 JAVA jvm系列--java内存区域
2023-03-16 JAVA代码块你了解吗
2023-03-16 超详细讲解Java线程池
2023-03-16 java Long类型转为String类型的两种方式及区别说明

总裁主题

分享最新WordPress教程共同学习，共同进步，共同成长！

热门文章

热门评论

如遇问题，请联系客服
联系客服请注明来意高端主题开发
微信公众号

总裁主题·高端主题
返回顶部