Merge pull request #751 from mariroz/dev

quick fix of issue #750: mulipage content for politico.com/magazine articles
This commit is contained in:
Nicolas Lœuillet 2014-07-07 21:11:07 +02:00
commit 4247b37551

4
inc/3rdparty/site_config/standard/politico.com.txt vendored Normal file → Executable file
View file

@ -4,10 +4,14 @@ body://div[contains(@class,"story-text")]
# Why doesn't this work? next_page_link://ul[contains(@class,"pagination")]/li/a[@rel="next"]
next_page_link://ul[contains(@class,"pagination")]/li[contains(@class, "current")]/following-sibling::node()/a
next_page_link://div[contains(@class,"pagination")]/ol/li[contains(@class, "current")]/following-sibling::node()/a
date://meta[@name="publish_date"]/@content
strip://div[contains(@class, "breadcrumbs")]
strip://a[contains(@class, "hidden")]
strip://div[contains(@class, "story-embed")]
strip://div[contains(@class, "story-text")]//p/a[contains(text(), "Also on POLITICO:")]/..
strip://div[contains(@class, "story-interrupt")]
strip://footer[contains(@class, "author-bio")]
test_url: http://www.politico.com/news/stories/0712/78105.html