quick fix of issue #750: mulipage content for politico.com/magazine articles

This commit is contained in:
Maryana Rozhankivska 2014-07-07 19:17:55 +03:00
parent 6924253423
commit 82980a148b

4
inc/3rdparty/site_config/standard/politico.com.txt vendored Normal file → Executable file
View file

@ -4,10 +4,14 @@ body://div[contains(@class,"story-text")]
# Why doesn't this work? next_page_link://ul[contains(@class,"pagination")]/li/a[@rel="next"]
next_page_link://ul[contains(@class,"pagination")]/li[contains(@class, "current")]/following-sibling::node()/a
next_page_link://div[contains(@class,"pagination")]/ol/li[contains(@class, "current")]/following-sibling::node()/a
date://meta[@name="publish_date"]/@content
strip://div[contains(@class, "breadcrumbs")]
strip://a[contains(@class, "hidden")]
strip://div[contains(@class, "story-embed")]
strip://div[contains(@class, "story-text")]//p/a[contains(text(), "Also on POLITICO:")]/..
strip://div[contains(@class, "story-interrupt")]
strip://footer[contains(@class, "author-bio")]
test_url: http://www.politico.com/news/stories/0712/78105.html