你正在訪問的內容是外部程式的映像位址,僅用於使用者加速訪問,本站無法保證其可靠性。當前的連結位址(單點即可複製)為 https://greasyfork.org.cn/zh-CN/scripts/25068-downloadallcontent/discussions/185375,源站連結 點此以跳轉。
通用网站内容爬虫抓取工具,可批量抓取任意站点的小说、论坛内容等并保存为TXT文档
Thanks for suggestion, Great work!
I think it's not a good idea to add Turndown to this project. As this script is for novel sites, and most of them are crammed with advertisements. If I convert the content with full-supported markdown, the obfuscation will be inevitable.
Thank you.
Sometimes, I do happen to manually edit markdown files produced by Turndown sue to javascript and css script that were catched in the process.
The HTML seems to work as expected, most of the time, though I should improve it.
Add Markdown support.
Please look into my script which supports Plain Text, Markdown and HTML.