Web scraping

A collection of resources dedicated to web scraping: How to do it effectively and better understand possible limitations. Also covers web crawling.

What is a Headless Browser? A Beginner's Guide
Web scraping
What is a Headless Browser? A Beginner's Guide

Dive into headless browsers, the key to speed and resource efficiency in web processes. Learn to optimize your testing and development tasks effortlessly.

Bot Detection 101: How to Identify and Block Malicious Bots
Web scraping
Bot Detection 101: How to Identify and Block Malicious Bots

Learn how to detect a bot on your website, app, or API and protect your online business from fraud and security threats. Discover the best bot detection techniques and tools.

Introduction to Puppeteer: Automating Data Collection
How to
Introduction to Puppeteer: Automating Data Collection

Puppeteer is a powerful Node.js library for automation in Chromium-based browsers — let's take a closer look at how it works and how to set it up for web scraping.

Web Scraping with Antidetect Browsers
Web scraping
Web Scraping with Antidetect Browsers

Learn how to use antidetect browsers for web scraping and bypass anti-bot measures. Compare the best antidetect browsers and their features.

Data Gathering Issues: How to Deal with CAPTCHAs?
Web scraping
Data Gathering Issues: How to Deal with CAPTCHAs?

CAPTCHA is a powerful tool for distinguishing between human and bot traffic. How does it work — and is it possible to circumvent it? Let's find out.

How to Crawl a Website Without Getting Blocked
Web scraping
How to Crawl a Website Without Getting Blocked

Learn how to crawl a site without getting blocked by following these 8 tips. Avoid IP bans, CAPTCHAs, and honeypots with web scraping best practices.

Is web scraping legal?
Web scraping
Is web scraping legal?

Web scraping is a powerful tool — and many data owners want to protect their data. They may even pursue legal action, so you should know web scraping limits.

CSS selectors cheat sheet for web scraping
Web scraping
CSS selectors cheat sheet for web scraping

Learn how to use CSS selectors to style HTML elements with this handy cheat sheet. Includes examples and explanations for all common selectors.

Why you should use residential proxies for web scraping
Proxies and business
Why you should use residential proxies for web scraping

Learn how residential proxies can help you perform web scraping without getting blocked or banned by websites. A guide to avoid blocking, throttling, and captchas by websites.

Web scraping with R and rvest
How to
Web scraping with R and rvest

Learn how to use R and rvest to scrape data from any website in this comprehensive tutorial: Inspect HTML elements, write CSS selectors, and store your scraped data in a tidy format.

Browser fingerprinting: How it works and how to avoid it
Proxies and business
Browser fingerprinting: How it works and how to avoid it

In this article, you will learn all the basics about browser fingerprints — and how you can avoid fingerprinting, too.

Using JavaScript and Node.js for web scraping
How to
Using JavaScript and Node.js for web scraping

Learn how to build a web scraper on Node.js with JavaScript in this step-by-step guide. You'll discover how to perform scraping with Node.js and Puppeteer.

Get In Touch
Have a question about Infatica? Get in touch with our experts to learn how we can help.