We provide a well-structured database of domain names classified into 80 different categories. Data is processed by a machine learning (ML) engine that scans the websites’ content and retrieves their meta tags. Categories are then assigned to each website using natural language processing (NLP).
Our web crawlers use a combination of advanced ML algorithms and human assistance to parse web pages.
All of the information we provide is well-parsed and normalized to a consistent format for easy integration into business processes. Get both parsed and raw data via a database download in MySQL and comma-separated values (CSV) files.
You can also combine Website Categorization Database with our WHOIS Database Download to obtain WHOIS records containing contact information, registrant details, and more for any domain under all 80 categories.