Webbots, spiders, and screen scrapers : a guide to developing Internet agents with PHP/CURL / Michael Schrenk.

"Webbots, Spiders, and Screen Scrapers will show you how to create simple programs with PHP/CURL to mine, parse, and archive online data to help you make informed decisions."--Overview from publisher's website.

Saved in:
Bibliographic Details
Main Author: Schrenk, Michael
Format: eBook
Language:English
Published: San Francisco : No Starch Press, ©2012.
Edition:2nd ed.
Subjects:
Online Access:Click for online access
Table of Contents:
  • pt. 1. Fundamental concepts and techniques. What's in it for you?
  • Ideas for webbot projects
  • Downloading web pages
  • Basic parsing techniques
  • Advanced parsing with regular expressions
  • Automating form submission
  • Managing large amounts of data
  • pt. 2. Projects. Price-monitoring webbots
  • Image-capturing webbots
  • Link-verification webbots
  • Search-ranking webbots
  • Aggregation webbots
  • FTP webbots
  • Webbots that read email
  • Webbots that send email
  • Converting a website into a function
  • pt. 3. Advanced technical considerations. Spiders
  • Procurement webbots and snipers
  • Webbots and cryptography
  • Authentication
  • Advanced cookie management
  • Scheduling webbots and spiders
  • Scraping difficult websites with browser macros
  • Hacking iMacros
  • Deployment and scaling
  • pt. 4. Larger considerations. Designing stealthy webbots and spiders
  • Proxies
  • Writing fault-tolerant webbots
  • Designing webbot-friendly websites
  • Killing spiders
  • Keeping webbots out of trouble.