Web scraping

A collection of resources dedicated to web scraping: How to do it effectively and better understand possible limitations. Also covers web crawling.

How to Detect Bots and Stop Malicious Attacks?
Web scraping
How to Detect Bots and Stop Malicious Attacks?

Effective bot detection can make the difference between optimal and subpar business performance. In this article, we’ll explore what methods you can use to detect malicious bots.

What Is Browser Fingerprinting & How Does It Work?
Proxies and business
What Is Browser Fingerprinting & How Does It Work?

Explore the basics of browser fingerprints in this easy-to-understand article. Learn how they affect online privacy and security in simple terms.

12 Best Antidetect Browsers for Work and Web Scraping
Web scraping
12 Best Antidetect Browsers for Work and Web Scraping

Looking for the best antidetect browsers for web scraping? Learn how to use these software tools to spoof your browser fingerprints, rotate your IPs, and access any website anonymously.

What is a Headless Browser? A Beginner's Guide
Web scraping
What is a Headless Browser? A Beginner's Guide

Dive into headless browsers, the key to speed and resource efficiency in web processes. Learn to optimize your testing and development tasks effortlessly.

Introduction to Puppeteer: Automating Data Collection
How to
Introduction to Puppeteer: Automating Data Collection

Puppeteer is a powerful Node.js library for automation in Chromium-based browsers — let's take a closer look at how it works and how to set it up for web scraping.

Is web scraping legal?
Web scraping
Is web scraping legal?

Web scraping is a powerful tool — and many data owners want to protect their data. They may even pursue legal action, so you should know web scraping limits.

CSS selectors cheat sheet for web scraping
Web scraping
CSS selectors cheat sheet for web scraping

Learn how to use CSS selectors to style HTML elements with this handy cheat sheet. Includes examples and explanations for all common selectors.

Why you should use residential proxies for web scraping
Proxies and business
Why you should use residential proxies for web scraping

Learn how residential proxies can help you perform web scraping without getting blocked or banned by websites. A guide to avoid blocking, throttling, and captchas by websites.

Using JavaScript and Node.js for web scraping
How to
Using JavaScript and Node.js for web scraping

Learn how to build a web scraper on Node.js with JavaScript in this step-by-step guide. You'll discover how to perform scraping with Node.js and Puppeteer.

lxml crash course
How to
lxml crash course

Learn how to use lxml, the most feature-rich and easy-to-use library for processing XML and HTML in Python. Create, parse, and query XML and HTML documents with lxml.

Web Crawlers Explained
Web scraping
Web Crawlers Explained

Web crawlers are the backbone of every data collection pipeline: Together with web scrapers, they help build products and services. Learn about web crawlers in this guide!

Data Collection: Definition, Methods & Challenges
Web scraping
Data Collection: Definition, Methods & Challenges

Data collection is a vital process that can help you research and gather information efficiently. In this guide, we’ll learn the ins and outs of data collection.

Get In Touch
Have a question about Infatica? Get in touch with our experts to learn how we can help.