PromptLoopGuides
    ⌘K
    PromptLoop

    Getting Started

    Getting Started

    Platform

    Tasks

    Datasets

    Capabilities

    Web Browsing

    Scraping Lists

    Dynamic Data Extraction

    Signals

    Accuracy & Confidence

    Entity Research

    Integrations

    Integrations

    Industry Guides

    Sales Plays

    GTM and Sales

    Capturing Efficiency

    Private Equity

    E-commerce

    Help & More

    Account Limits

    Help & More

    Contact

    Spreadsheet Apps

    On this page

    • Datasets - Running AI tas...
    • Getting Started
    • How to run an AI task on ...
    • Step 1: Upload a file wit...
    • Step 2: Launch the Job
    • Step 3: View results
    • Dataset Tools
    • Generating a Dataset
    • How to generate a dataset
    • New Column Gen
    # Datasets - Running AI tasks on files Promptloop streamlines the process of applying AI models to your data. We do this with datasets. Upload a CSV or Excel file, select relevant inputs for the model to use, and launch AI tasks on hundreds to thousands of rows. Datasets are where you will store, edit, and view data that you are processing with AI tasks. **Helpful Links** - [Datasets Page](/account/datasets) - [Generate a Dataset](/docs/autoloop#generating-a-dataset) - [Create a task](/account/custom/new) ## Getting Started This video overview provides a guide of how to get started and take full advantage of datasets. <VideoLink video_src='https://share.descript.com/view/y6OY0qHqVyk' image_src='https://web-public-photos.s3.amazonaws.com/datasetview.jpg'/> **Files and Versions** Datasets are where you can upload the data you are working with on the PromptLoop Platform. **Uploading** You can upload data to launch a job, using an existing AI task, or upload a dataset to filter and analyze first before running. **Generating Datasets** If you don't have any data to start, you can use our crawlers to pull back thousands of companies that meet your criteria. This is a great place to start if **Run an AI task on each row** When you run an AI task on a dataset, the job will run in the background and save as a new version in the dataset when its complete. This will allow you to run tens of thousands of operations at once and freeing you up for other work. <Callout title="Limits"> If you have questions about limits, or have a task that requires a large amount of rows, reach out to the team [here](/demo). </Callout> **Reliable and Scalable** Datasets are an extremely powerful feature available to all users and one of the core capabilities of how we think about leveraging AI models efficiently. ## How to run an AI task on a spreadsheet file upload ### Step 1: Upload a file with data Datasets let you use any data table in an Excel or CSV file. Our systems automatically detect columns and let you select which you want to use. - You can select the columns that you will use as inputs for the task - this is often a single column like a website or search term - Results from your task - new columns or rows - will be added into the uploaded sheet and available to download as a new file ### Step 2: Launch the Job Select your task and launch the job with the correct input columns. Jobs are immediately added to the queue based on your account tier and capacity. You will see progress and results once the task is running. Even extremely large jobs usually complete within 90 minutes. If you do not already have an AI task created, use the editor to create one or copy a template to edit from the template library. The [task creation tool](/account/custom/new) guides will help you get started. ### Step 3: View results Results are saved as a new dataset version for review. You can then search and filter data before exporting and using the results of the task. You can also run data on another task right from the datasets page. <Image src='https://web-public-photos.s3.amazonaws.com/datasetview.jpg' alt='Choose a task' width={800} height={420} /> For help setting up your first datasets, or questions about capacity and running large files, reach out the the team or book time with us to let us help you. [Book a Session](/demo) ## Dataset Tools ### Generating a Dataset <Image src='https://img.promptloop.com/generate_options-695f976e.png' alt='Preview' width={800} height={420} /> <Callout title="Access">This is available for all enterprise teams and it is free to generate as many datasets as needed. </Callout> Dataset generation utilizes our web crawlers to crawl millions of businesses to construct relevant datasets from scratch for you to identify high quality sales targets. This is split into two categories now: Geographic Crawl and Global Crawl. **Geographic Crawl** - This takes in geographic constraints (I.e New York State, Atlanta, or the South East US) as well as a keyword (I.e Auto repair shops, hotels, coffee shops) and returns a deduped list of all businesses in that region that match those keywords. This crawls tens of millions of businesses with physical addresses and is great starting point that you can then run more custom tasks on. **Global Crawl** - This takes in business specific filters (I.e B2B, Healthcare, Headquartered in the US) and uses our crawlers to return a list of thousands of businesses matching that criteria and their websites. We pull all of the information directly from their website, so it is as up - to - date as possible. <Callout title="Limits"> Dataset Generation is limited based on your account tier and set up. If you would like assistance increasing limits please contact your admin or book a call with the team [here](/demo). </Callout> #### How to generate a dataset **Step 1** <Image src='https://img.promptloop.com/generate_dataset_blank_datasets-6ebd566c.png' alt='Preview' width={800} height={420} /> Navigate to the [Datasets]('promptloop.com/account/datasets') page and select **Generate Dataset** **Step 2** <Image src='https://img.promptloop.com/generate_options-695f976e.png' alt='Preview' width={800} height={420} /> Choose your option. **Step 3 - Geographic Crawl** <Image src='https://img.promptloop.com/CleanShot 2025-06-01 at [email protected]' alt='Preview' width={800} height={420} /> Select the region(s), city(s), and or specific zipcodes you want to target and put in a keyword to search for. You should only ever provide one keyword type per generation (e.g don't put: 'hotels, restaurants, and schools') as the crawlers will look for businesses that match all of them. Instead if you need coverage of all three, run three separate generations. We recommend starting with a small subset of the geography to confirm the keyword is returning the correct type of businesses. **Step 4** <Image src='https://img.promptloop.com/Generating_dataset_dataset_generating-850c6428.png' alt='Preview' width={800} height={420} /> Once you click generate, you will see a dataset row show up as the crawler is working. It will automatically update once it is completed and usually takes a couple of minutes. **Step 5** <Image src='https://img.promptloop.com/completed_dataset_generation_dataset-74a24079.png' alt='Preview' width={800} height={420} /> Once generated, the dataset can be used the same as any other and you can run additional enrichment tasks as needed! ### New Column Gen For any dataset on PromptLoop, you can take advantage of the new column generation feature to quickly format, clean up, or edit the output of a task or change any column. <Image src='https://img.promptloop.com/new-col-button.jpg' alt='Preview' width={800} height={420} /> <Callout title="No Credits Needed"> New Column Gen is available for free on all datasets with all team and enterprise plans. You can use this for quick and reliable reformatting right in the PromptLoop platform </Callout> Just like an Excel function, you can select the existing columns that you want to use as input and context and provide instructions for what you want to accomplish. <Image src='https://img.promptloop.com/new-col-launch.jpg' alt='Preview' width={800} height={420} /> When you are creating your prompt you can run a quick preview on random rows. This allows you to dial in the final results and iterate quickly. <Image src='https://img.promptloop.com/new-col-preview.jpg' alt='Preview' width={800} height={420} /> This then runs on all rows in your dataset automatically when you click **Generate**. It runs much faster than a normal task, but allow about 10 minutes per 20k rows of data. The completed file with the new generated column will be automatically added as a new version when it completes.

    Datasets - Running AI tasks on files

    Promptloop streamlines the process of applying AI models to your data. We do this with datasets. Upload a CSV or Excel file, select relevant inputs for the model to use, and launch AI tasks on hundreds to thousands of rows.

    Datasets are where you will store, edit, and view data that you are processing with AI tasks.

    Helpful Links

    • Datasets Page
    • Generate a Dataset
    • Create a task

    Getting Started#

    This video overview provides a guide of how to get started and take full advantage of datasets.

    Watch the video

    Files and Versions Datasets are where you can upload the data you are working with on the PromptLoop Platform.

    Uploading You can upload data to launch a job, using an existing AI task, or upload a dataset to filter and analyze first before running.

    Generating Datasets If you don't have any data to start, you can use our crawlers to pull back thousands of companies that meet your criteria. This is a great place to start if

    Run an AI task on each row When you run an AI task on a dataset, the job will run in the background and save as a new version in the dataset when its complete. This will allow you to run tens of thousands of operations at once and freeing you up for other work.

    Limits

    If you have questions about limits, or have a task that requires a large amount of rows, reach out to the team here.

    Reliable and Scalable Datasets are an extremely powerful feature available to all users and one of the core capabilities of how we think about leveraging AI models efficiently.

    How to run an AI task on a spreadsheet file upload#

    Step 1: Upload a file with data#

    Datasets let you use any data table in an Excel or CSV file. Our systems automatically detect columns and let you select which you want to use.

    • You can select the columns that you will use as inputs for the task - this is often a single column like a website or search term
    • Results from your task - new columns or rows - will be added into the uploaded sheet and available to download as a new file

    Step 2: Launch the Job#

    Select your task and launch the job with the correct input columns. Jobs are immediately added to the queue based on your account tier and capacity. You will see progress and results once the task is running. Even extremely large jobs usually complete within 90 minutes.

    If you do not already have an AI task created, use the editor to create one or copy a template to edit from the template library. The task creation tool guides will help you get started.

    Step 3: View results#

    Results are saved as a new dataset version for review. You can then search and filter data before exporting and using the results of the task. You can also run data on another task right from the datasets page.

    Choose a task

    For help setting up your first datasets, or questions about capacity and running large files, reach out the the team or book time with us to let us help you. Book a Session

    Dataset Tools#

    Generating a Dataset#

    Preview
    Access
    This is available for all enterprise teams and it is free to generate as many datasets as needed.

    Dataset generation utilizes our web crawlers to crawl millions of businesses to construct relevant datasets from scratch for you to identify high quality sales targets. This is split into two categories now: Geographic Crawl and Global Crawl.

    Geographic Crawl - This takes in geographic constraints (I.e New York State, Atlanta, or the South East US) as well as a keyword (I.e Auto repair shops, hotels, coffee shops) and returns a deduped list of all businesses in that region that match those keywords. This crawls tens of millions of businesses with physical addresses and is great starting point that you can then run more custom tasks on.

    Global Crawl - This takes in business specific filters (I.e B2B, Healthcare, Headquartered in the US) and uses our crawlers to return a list of thousands of businesses matching that criteria and their websites. We pull all of the information directly from their website, so it is as up - to - date as possible.

    Limits

    Dataset Generation is limited based on your account tier and set up. If you would like assistance increasing limits please contact your admin or book a call with the team here.

    How to generate a dataset#

    Step 1

    Preview

    Navigate to the Datasets page and select Generate Dataset

    Step 2

    Preview

    Choose your option.

    Step 3 - Geographic Crawl

    Preview

    Select the region(s), city(s), and or specific zipcodes you want to target and put in a keyword to search for. You should only ever provide one keyword type per generation (e.g don't put: 'hotels, restaurants, and schools') as the crawlers will look for businesses that match all of them. Instead if you need coverage of all three, run three separate generations. We recommend starting with a small subset of the geography to confirm the keyword is returning the correct type of businesses.

    Step 4

    Preview

    Once you click generate, you will see a dataset row show up as the crawler is working. It will automatically update once it is completed and usually takes a couple of minutes.

    Step 5

    Preview

    Once generated, the dataset can be used the same as any other and you can run additional enrichment tasks as needed!

    New Column Gen#

    For any dataset on PromptLoop, you can take advantage of the new column generation feature to quickly format, clean up, or edit the output of a task or change any column.

    Preview
    No Credits Needed

    New Column Gen is available for free on all datasets with all team and enterprise plans. You can use this for quick and reliable reformatting right in the PromptLoop platform

    Just like an Excel function, you can select the existing columns that you want to use as input and context and provide instructions for what you want to accomplish.

    Preview

    When you are creating your prompt you can run a quick preview on random rows. This allows you to dial in the final results and iterate quickly.

    Preview

    This then runs on all rows in your dataset automatically when you click Generate. It runs much faster than a normal task, but allow about 10 minutes per 20k rows of data. The completed file with the new generated column will be automatically added as a new version when it completes.