By using seoforum’s services you agree to our Cookies Use and Data Transfer outside the EU.
We and our partners operate globally and use cookies, including for analytics, personalisation, ads and Newsletters.

  • Join the best UK dedicated SEO Forum

    Provide or get advice on everything SEO, ask questions, gain confirmation or just become apart of a friendly, like minded community who love SEO and Online Marketing.


    Join 50,000 members!

BatchURLScraper - Extraction data using XPath, CSSPath, XQuery and Regex

chaser

chaser

Member
Established Memeber
Hello!

We present to your attention a free BatchURLScraper software, designed to extract data from web pages using XPath, CSSPath, XQuery and Regex methods.

buscr.png


buscr-scrape-rules.png


buscr-debug.png


BatchURLScraper features:
  • data parsing and extraction from a list of URLs
  • flexible configuration of parsing using XPath, CSSPath, XQuery and Regex extraction methods
  • export reports to Excel (CSV format)

Download page (5 Mb): site-analyzer.pro/soft/batch-url-scraper/

We will be glad to receive any feedback and wishes regarding the work of the program.
 
chaser

chaser

Member
Established Memeber
New version BatchURLScraper 1.3

get-ga.png


get-templates-counter.png


buscr-settings.png


What's new:
  • expanded the number of pages for parsing from 1000 to 5000 URLs
  • added the ability to scrape through HTML templates
  • added the ability to extract data through CSSpath attributes
  • added the ability to scrape through External and Internal HTML
  • added the ability to use Proxy Servers lists
  • fixed bug with incorrect User-Agent saving

Homepage: site-analyzer.pro/soft/batch-url-scraper/
 
chaser

chaser

Member
Established Memeber
New version of the BatchURLScraper 1.4

What's new:
  • fixed error with validation of HTML templates
  • optimized work with regular expressions
  • we added ability to ignore duplications in scraping results
  • fixed problem with not correct using pauses between requests to web pages
  • range of pauses between requests has been extended to one and a half minutes
  • finalized and improved translation
  • fixed memory leaks
 

Latest Products

  • [Rivmedia] Lazy Loader XF2
    [Rivmedia] Lazy Loader XF2
    Load images asynchronously on your forum, allowing images to be loaded only when they are in view
    • Rivmedia
    • Updated:
  • [Rivmedia] Guest Redirect & Profile unlink
    [Rivmedia] Guest Redirect & Profile unlink
    Forums which prevent member profile access for guests, redirction and unlinking for profiles
    • Rivmedia
    • Updated:
  • [Rivmedia] Simple Redirects
    [Rivmedia] Simple Redirects
    Simple redirects allows forum admins to make simple 301 or 302 redirects via their admin panel
    • Rivmedia
    • Updated:
  • [Rivmedia] Minimum Message Length
    [Rivmedia] Minimum Message Length
    Eradicate short, pointless posts with a minimum message length and improving content worth on a thre
    • Rivmedia
    • Updated:
Top