Quick guide to creating datasets using ChatGPT
This article was co-authored by wikiHow staff writer, Kyle Smith. Kyle Smith is a wikiHow Technology Writer, learning and sharing information about the latest technology. He has presented his research at multiple engineering conferences and is the writer and editor of hundreds of online electronics repair guides. Kyle received a BS in Industrial Engineering from Cal Poly, San Luis Obispo.
This article has been viewed 1,735 times.
Learn more...
Trying to create an example dataset in ChatGPT? The text AI software can quickly generate a table of example information for any subject! This is great for getting ideas for your own data collection, or for practicing data analysis. Note that the ChatGPT model is limited to creating data based on its training, meaning that the dataset may not be accurate or represent the real world. This wikiHow guide will show you how to use ChatGPT to create a dataset.
Things You Should Know
- Type your request for a dataset by including the subject and information you're looking for.
- You can add specifics like length of dataset and variables to include.
- The dataset will typically output as a table.
- Add "Format the dataset as a csv" to have ChatGPT output the dataset in the csv format.
Steps
Creating a Dataset with ChatGPT
-
1Go to https://chat.openai.com/auth/login and log in. This is the official website for ChatGPT. If you don't already have one, you'll need to create an OpenAI account to access ChatGPT.
- Note that ChatGPT has an approximate word limit, so it can only generate small datasets.
- If ChatGPT is at capacity, you'll need to come back at a less busy time.
-
2Type in a request for a dataset. In the textbox at the bottom of ChatGPT, enter your request for a dataset. The request will typically output as a table. Here are a few examples:
- "Create an example dataset of customer orders from a kitchen supplies company."
- "Create an example dataset with 10 entries of California cities population information."
- "Create an example dataset showing geological samples of different locations."
-
3Modify the request. You can add more specific information that you want included in the example dataset. For example, you could add the specific variables you want in the set and how long you want the set to be. Here's an example:
- "Create an example dataset of customer orders from a kitchen supplies company. Please include the price and quantity of each order. Also include the customer's state location. Make the dataset 5 entries long."
-
4Change the dataset to a csv format. If you need to copy the dataset as a csv, you can request that ChatGPT formats it as a csv. The dataset will typically appear as a code snippet.
- Here's an example: "Create an example dataset of customer orders from a kitchen supplies company. Format it as a csv."
- You can click Copy code to quickly copy the entire csv dataset.
-
5Request information about how to analyze the data. ChatGPT can also provide tutorials on how to analyze datasets (although the accuracy of its information can vary). Even if the code it provides isn't entirely correct, it can be a good place to start!
- For example, you could submit: "Create an example dataset of customer orders from a kitchen supplies company. Format it as a csv."
- Then, in a follow-up request, submit: "How would I analyze the dataset in Python?"
- You can replace Python with whichever software you're using, such as R, SAS, and Microsoft Excel.
- If you encounter an error when running the code it provides, you can submit a follow-up asking ChatGPT to fix the issue: "When I ran the above code, I got the error [error text]. How can I change the code to fix it?"