PromptLoop helps you automatically research thousands of companies to find exactly what you need. Whether that is qualifying customers by market segment, monitoring their hiring page, checking what software they use, or analyzing their services. Get accurate data in minutes instead of hours of manual research.
The web browsing tasks allow you to build AI systems that can extract specific data from websites. You define the item(s) you are looking for and then our models will handle the rest allowing you to input a website and get back a row of data corresponding to the search items you defined in the task. We then let you run these tasks on a CSV or Excel file with thousands of inputs (websites). Its as simple as defining what you are looking for and uploading a file.
Advanced options let you define exactly the format you need data in to effortlessly build a proprietary dataset from scratch in minutes.
PromptLoop has three types of web browsing tasks:
Crawl Task - This takes a single website or domain and finds new datapoints (new columns) for it.
Search Task - This takes a search term and returns the matching link based on search criteria. This is a helpful starting point if you do not have a list of websites yet and need to find appropriate web resources.
List Task (Legacy) - This takes a single website or domain and returns multiple rows with multiple datapoints for it. For example pulling out all companies from a customer page.
To start, navigate to the tasks tab and click New Task. The New Task option will open a popup like below where you can describe what you are looking for. You should put in the data points you are interested in and any special formatting instructions.

PromptLoop will generate columns for your task based on what you asked for and you can preview and accept the columns.
With the query: I want the description of the law firm, their number and email and whether or not they offer personal injury services. PromptLoop generated the various columns with the proper search queries to help guide the model through the website. All of these are editable after creating the task, but this saves you a lot of time.

Or the query: I want the description of the law firm, I also want a list of the managing partners at the firm (first, last title and email) You will see that the generated task includes list columns for the managing partners. This means that for each managing partner found on the website, PromptLoop will return a row of data with the first name, last name, title, and email. The static columns (description) will be the same for each of those rows.

There are also many templates available for you to copy and edit. You can find them in the templates tab, or in our templates library.
You can immediately input and run results right in your account. When you are satisfied and don't need further edits, you can upload your spreadsheet with a column that has the relevant inputs (whatever the task requires). This test page is representative of one row of data and is useful for quickly testing formatting and instruction edits. Results are cached per version and input, but you can always select the three dots to the right of "Edit Task" and select "Clear Cache" to clear the cache for a specific version.

To run your task on a dataset, you should follow the guide here. Once your dataset is run on the web task, you will be able to view all results right in the datasets tab and search, filter and save versions.

When you go to the edit page of a task, you will see the following options:
There are three different depths for the crawl tasks: Single page, Smart Crawl, and Deep Research Smart Crawl. This determines how far into the website our models will navigate, looking for answers to your search queries.

Which should I use? For information that is located deep in a website, Deep research is the best option, but if the information you are looking for is always on the homepage, then basic Smart Crawl will be sufficient and run faster. We encourage customers to start with the Smart Crawl and if you notice that there are items not getting retrieved, then edit the task to use Deep Research. It takes no time to toggle between the two. If you only want information from a specific page, then Single Page is the best option for you.
These are the ouput columns for each input row of data. For example if the input is a law firm website, the results will be a row of data with the law firm's description, number, and whether or not they offer personal injury services. You can add, remove or adjust each of these output columns.

Formatting allows you to collect data in exactly the output format that you need. This includes pulling out website links, raw text, or numbers. You should also add additional instructions on top of formatting for more specific preferences.

This is a useful formatting type to ensure the uniformity of the output. When this is set for a search item then the output data for that column will always be one of the options or 'Not Found'. By ensuring output uniformity across the entire dataset, you can more easily analyze, sort, and compare the data.
When you select categories, you will have to add at least one category (you can add as many as you need as well as detailed instructions for how the models should categorize the output).

Sometimes you may want to extract a list of items from a website. This output will allow you to specify what data you want to get for each item as well as instructions about which items to include or exclude. Similarly to categories, you have to specify at least one list column.
Common use cases for this include:

If you want to generate a list from across the internet, you should use dataset generation instead. This list extract is designed to extract a list of items from across a single website.
Managed Tasks are first-party, versioned tasks maintained by PromptLoop. They let you add new, structured columns to a dataset with zero setup—ideal for fast, consistent enrichment and AI generation at scale. PromptLoop Tasks are “no-code AI agents,” so you can run them without prompt tuning.
These include Email and Phone lookup tools built into the platform. If you are able to uncover or upload names and company domains (Peter, Mangan, Promptloop.com), you can use our built in enrichment and data provider waterfall to add phone and email in one click. For this specific managed task, you only pay for successful enrichments. It includes industry leading accuracy and pricing.
Waterfall enrichment with 20+ data sources, including:
Credits vary for all managed tasks and are visible at run time.
Steps:

You can also use web search engines instead of websites as a starting point for a task. This opens up a variety of powerful options. There are some nuances that can yield better results when selecting one or more links from a search to return.
Common Examples
Query Options You can use standard search engine techniques within PromptLoop tasks. Here are a few that can work well.
site:name-of-site.com This will ensure results are ONLY from this specific site
-site:name-of-site.com This will EXCLUDE the site you include
apples AND oranges This will include results that appear for BOTH search terms limiting results
For help setting up advanced search tasks, get in touch for options to customize and optimize your results.
Our web browsing technology is engineered for speed. It allows you to navigate through multiple web pages in a fraction of the time it would take using traditional methods. This means you can quickly gather the data you need, making the most of your time and resources.
We run optimized resources to find the information you need directly from relevant sources. This allows for up-to-date, proprietary data delivered precisely in your required format. Because we leverage language models to navigate pages and identify relevant information, we can handle thousands of formats from millions of company page types, all at speeds unmatched by alternatives, including human-led research.
Unlike generative AI chat applications, PromptLoop uses models and techniques tuned to deliver accurate and formatted results only from trusted and provided sources. Without a source, we will not generate text for information, providing repeatable and reliable answers.
Whether you're looking to scrape data from a handful of web pages or perform Excel web scraping on a massive scale, our service is designed to scale with your needs. Our robust infrastructure ensures stability and performance, even when your data requirements grow exponentially.
We allow you to upload entire datasets to enrich using a CSV file or Excel file using the Datasets tool. This allows non-technical teams to create tasks to find precise information from company or entity websites and return formatted web scraping results for a bespoke dataset.
Our web browsing feature is not just about accessing the web—it's about retrieving data with precision. With it, you can specify the data format you need, whether it's Booleans, direct answers to questions, or other specific data types that standard datasets may not provide.
You can learn more about creating and customizing web research and enrichment tasks with this guide to creating custom tasks
A cutting-edge web scraping AI is at the core of our web browsing capability. It navigates the web intelligently, understanding the context and semantics of the content to deliver relevant and accurate results.
We take web scraping seriously, adhering to the best practices and compliance standards. Our web browsing capability is optimized for you, allowing you to focus on the information and sources essential to driving your business decisions, not on setting up the specifics of a research pipeline.
For companies requiring an even higher level of data security than that which we offer standard to all customers, we can customize our infrastructure to meet your team's requirements. You can learn more by scheduling a call with our team today
Designing an effective task depends on your end goal. This includes the tolerance for errors, importance of formatting, and how varying your input sites are.
Our Custom Tasks utilize the full potential of our web browsing feature to cater to a wide range of business applications, such as:
With our web browsing capability, Custom Tasks can transform the web into a treasure trove of actionable data for your business.
Experience our web browsing feature's unparalleled efficiency and precision within Custom Tasks. Start harnessing the power of structured data extraction to make informed decisions and drive your business forward.
Ready to unlock the full potential of the web for your business needs? Request a demo here and see our web browsing capability.