本文主要是介绍HtmlAgilityPack使用(二)【爬取每日一文】,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!
Program.cs代码:
using HtmlAgilityPack;
using System;
using System.IO;
using System.Linq;namespace MyDemo
{class Program{static void Main(string[] args){for (int i = 0; i < 100; i++){var url = "https://meiriyiwen.com/random/";HtmlWeb web = new HtmlWeb();HtmlDocument doc = web.Load(url);HtmlNode node = doc.DocumentNode.SelectSingleNode("//*[@id=\"article_show\"]");node.Descendants().Where(n => n.Name == "script" || n.Name == "style" || n.Name == "#comment" || n.Id == "bdshare").ToList().ForEach(n => n.Remove());var title = node.SelectSingleNode("//h1").InnerText;var author = node.SelectSingleNode("//*[@class=\"article_author\"]").InnerText;var article = node.SelectSingleNode("//*[@class=\"article_text\"]").InnerText.TrimStart();Console.WriteLine(title + "\r\n");Console.WriteLine(author + "\r\n");Console.WriteLine(article);using (FileStream fs = new FileStream(Environment.CurrentDirectory+@"\Article\" + title + "-" + author + ".txt", FileMode.OpenOrCreate)){StreamWriter stream = new StreamWriter(fs);stream.WriteLine(title + "\r\n");stream.WriteLine(author + "\r\n");stream.WriteLine(article);stream.Close();stream.Dispose();}}}}
}
运行结果如图:
这篇关于HtmlAgilityPack使用(二)【爬取每日一文】的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!