Practical Web Scraping for Data Science Best Practices and Examples with Python / by Seppe vanden Broucke, Bart Baesens.

This book provides a complete and modern guide to web scraping, using Python as the programming language, without glossing over important details or best practices. Written with a data science audience in mind, the book explores both scraping and the larger context of web technologies in which it op...

Full description

Saved in:
Bibliographic Details
Main Authors: vanden Broucke, Seppe (Author), Baesens, Bart (Author)
Corporate Author: SpringerLink (Online service)
Format: eBook
Language:English
Published: Berkeley, CA : Apress : Imprint: Apress, 2018.
Edition:1st ed. 2018.
Series:Springer eBook Collection.
Subjects:
Online Access:Click to view e-book
Holy Cross Note:Loaded electronically.
Electronic access restricted to members of the Holy Cross Community.

MARC

LEADER 00000nam a22000005i 4500
001 b3276095
003 MWH
005 20191220131237.0
007 cr nn 008mamaa
008 180418s2018 xxu| s |||| 0|eng d
020 |a 9781484235829 
024 7 |a 10.1007/978-1-4842-3582-9  |2 doi 
035 |a (DE-He213)978-1-4842-3582-9 
050 4 |a E-Book 
072 7 |a UMX  |2 bicssc 
072 7 |a COM051360  |2 bisacsh 
072 7 |a UMX  |2 thema 
100 1 |a vanden Broucke, Seppe.  |e author.  |4 aut  |4 http://id.loc.gov/vocabulary/relators/aut 
245 1 0 |a Practical Web Scraping for Data Science  |h [electronic resource] :  |b Best Practices and Examples with Python /  |c by Seppe vanden Broucke, Bart Baesens. 
250 |a 1st ed. 2018. 
264 1 |a Berkeley, CA :  |b Apress :  |b Imprint: Apress,  |c 2018. 
300 |a XVI, 306 p. 35 illus.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a Springer eBook Collection 
505 0 |a Part I: Web Scraping Basics -- 1. Introduction -- 2. The Web Speaks HTTP -- 3. Stirring the HTML and CSS Soup -- Part II: Advanced Web Scraping -- 4. Delving Deeper in HTTP -- 5. Dealing with JavaScript -- 6. From Web Scraping to Web Crawling -- Part III: Managerial Concerns and Best Practices -- 7. Managerial and Legal Concerns -- 8. Closing Topics -- 9. Examples. 
520 |a This book provides a complete and modern guide to web scraping, using Python as the programming language, without glossing over important details or best practices. Written with a data science audience in mind, the book explores both scraping and the larger context of web technologies in which it operates, to ensure full understanding. The authors recommend web scraping as a powerful tool for any data scientist’s arsenal, as many data science projects start by obtaining an appropriate data set. Starting with a brief overview on scraping and real-life use cases, the authors explore the core concepts of HTTP, HTML, and CSS to provide a solid foundation. Along with a quick Python primer, they cover requests and Beautiful Soup, Selenium for JavaScript-heavy sites, and web crawling in detail. The book finishes with a recap of best practices and a collection of examples that bring together everything you've learned and illustrate various data science use cases. 
590 |a Loaded electronically. 
590 |a Electronic access restricted to members of the Holy Cross Community. 
650 0 |a Python (Computer program language). 
650 0 |a Database management. 
650 0 |a Big data. 
690 |a Electronic resources (E-books) 
700 1 |a Baesens, Bart.  |e author.  |4 aut  |4 http://id.loc.gov/vocabulary/relators/aut 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
830 0 |a Springer eBook Collection. 
856 4 0 |u https://holycross.idm.oclc.org/login?auth=cas&url=https://doi.org/10.1007/978-1-4842-3582-9  |3 Click to view e-book  |t 0 
907 |a .b32760954  |b 04-18-22  |c 02-26-20 
998 |a he  |b 02-26-20  |c m  |d @   |e -  |f eng  |g xxu  |h 0  |i 1 
912 |a ZDB-2-CWD 
950 |a Professional and Applied Computing (Springer-12059) 
902 |a springer purchased ebooks 
903 |a SEB-COLL 
945 |f  - -   |g 1  |h 0  |j  - -   |k  - -   |l he   |o -  |p $0.00  |q -  |r -  |s b   |t 38  |u 0  |v 0  |w 0  |x 0  |y .i2189257x  |z 02-26-20 
999 f f |i a7c070b5-4bc0-591e-9ce9-471df187c180  |s 214ae5f2-fee8-53be-8ee6-a218a0a59c42  |t 0 
952 f f |p Online  |a College of the Holy Cross  |b Main Campus  |c E-Resources  |d Online  |t 0  |e E-Book  |h Library of Congress classification  |i Elec File