jar包下载:https://github.com/CrawlScript/WebCollector/blob/master/webcollector-2.73-alpha-bin.zip
使用介绍(超详细):https://blog.csdn.net/wangmx1993328/article/details/81667284?utm_source=blogxgwz0#commentBox
网页抽取算法介绍:
1.https://blog.csdn.net/dreamzuora/article/details/83623754
2.https://blog.csdn.net/AJAXHu/article/details/48382381