Getting Structured Data From the Internet: Running Web Crawlers/scrapers on a Big Data Production Scale
/Jay M. Patel
- 1st edition
- New York Apress 2023
- xix, 397p. ; ill. : 25 cm.
Includes index
This book teaches reader to python scripts to crawl through websites at scale and scrape data from HTML and Javascript-enabled pages and convert it into structured data formats such as CSV, Excel, JSON, or load it into SQL database of your choice.