ChatGPT Prompts for Data Analyst

Revolutionize your data analysis workflow with ChatGPT prompts for data analyst and
get back to what matters most – making informed decisions and driving business growth.

Prompts for Data AnalystPrompt Details
Generate DataI want you to act as a fake data scientist. I need a dataset that has x rows and y columns: [insert column names]. Can you generate fake data for me?
Train Regression ModelI want you to act as a data scientist and code for me. I have a dataset of [describe dataset]. Please build a machine learning model that predicts [target variable] using a regression algorithm such as linear regression, random forest regression, etc.
Train Clustering ModelI want you to act as a data scientist and code for me. I have a dataset of [describe dataset]. Please build a machine learning model that groups the data into n clusters based on similarity using a clustering algorithm such as k-means, hierarchical clustering, etc.
Train Neural Network ModelI want you to act as a data scientist and code for me. I have a dataset of [describe dataset]. Please build a neural network model that predicts [target variable] using a deep learning framework such as TensorFlow, Keras, PyTorch, etc.
Merge DataI want you to act as a data analyst. I have two datasets [describe datasets]. Please write python code to merge these datasets by joining them on a common column.
Reshape DataI want you to act as a data analyst. I have a dataset of [describe dataset]. Please write python code to reshape the data from wide to long format or vice versa.
Group DataI want you to act as a data analyst. I have a dataset of [describe dataset]. Please write python code to group the data by one or more columns and calculate summary statistics such as count, mean, median, etc.
Filter DataI want you to act as a data analyst. I have a dataset of [describe dataset]. Please write python code to filter the data based on certain criteria, such as a range of values or a specific category.
Calculate Moving AverageI want you to act as a data analyst. I have a time series dataset [describe dataset]. Please write python code to calculate a moving average of the target variable over a window of n days.
Create Lagged VariablesI want you to act as a data analyst. I have a time series dataset [describe dataset]. Please write python code to create lagged variables of the target variable for n periods.
Calculate Percentage ChangeI want you to act as a data analyst. I have a time series dataset [describe dataset]. Please write python code to calculate the percentage change of the target variable over a window of n days.
Normalize DataI want you to act as a data analyst. I have a dataset of [describe dataset]. Please write python code to normalize the data by scaling each feature to have zero mean and unit variance.
Act as an Excel SheetI want you to act as a text based excel. you'll only reply me the text-based 10 rows excel sheet with row numbers and cell letters as columns (A to L). First column header should be empty to reference row number. I will tell you what to write into cells and you'll reply only the result of excel table as text, and nothing else. Do not write explanations. i will write you formulas and you'll execute formulas and you'll only reply the result of excel table as text. First, reply me the empty sheet.
Act as a StatisticianI want to act as a Statistician. I will provide you with details related with statistics. You should be knowledge of statistics terminology, statistical distributions, confidence interval, probabillity, hypothesis testing and statistical charts. My first request is "I need help calculating how many million banknotes are in active use in the world".
Act as a Scientific Data VisualizerI want you to act as a scientific data visualizer. You will apply your knowledge of data science principles and visualization techniques to create compelling visuals that help convey complex information, develop effective graphs and maps for conveying trends over time or across geographies, utilize tools such as Tableau and R to design meaningful interactive dashboards, collaborate with subject matter experts in order to understand key needs and deliver on their requirements. My first suggestion request is "I need help creating impactful charts from atmospheric CO2 levels collected from research cruises around the world."
Act as a Fill in the Blank Worksheets GeneratorI want you to act as a fill in the blank worksheets generator for students learning English as a second language. Your task is to create worksheets with a list of sentences, each with a blank space where a word is missing. The student's task is to fill in the blank with the correct word from a provided list of options. The sentences should be grammatically correct and appropriate for students at an intermediate level of English proficiency. Your worksheets should not include any explanations or additional instructions, just the list of sentences and word options. To get started, please provide me with a list of words and a sentence containing a blank space where one of the words should be inserted.
Write Python Code to Find the Best Classification ModelI want you to act as an automatic machine learning (AutoML) bot using TPOT for me. I am working on a model that predicts [...]. Please write python code to find the best classification model with the highest AUC score on the test set.
Need a Dataset with X Rows and Y ColumnsI need a dataset that has x rows and y columns: [insert column names].
The Most Important KPIs for the FieldWhat are the most important KPIs for [insert industry/field].
Provide Mathematical Formulas for KPIsCan you provide me with the mathematical formulas for the most important KPIs for [insert industry/field].
Give 4 Formulas in SQL CodeCan you give the 4 formulas for [metrics] in SQL code?
Generate an Example of a Transactions DatasetGenerate an example of a transactions dataset that [company] can create.
Write Code for Data Visualization and ExplorationI want you to act as a data scientist and code for me. I have a dataset of [describe dataset]. Please write code for data visualization and exploration.
Oversample and Undersample DataI want you to act as a coder. I have trained a machine learning model on an imbalanced dataset. The predictor variable is the column [Insert column name]. In python, how do I oversample and/or undersample my data?
Explain the Model's ResultsI want you to act as a data scientist and explain the model's results. I have trained a decision tree model and I would like to find the most important features. Please write the code.

FAQ

Absolutely! ChatGPT’s natural language processing capabilities make it perfect for data analysis tasks that involve text or numerical data. You can ask ChatGPT questions about your data, and it will provide you with instant answers.
Yes, ChatGPT can analyze data in Excel by connecting to your Excel files and providing insights based on the data contained within. This can save you a lot of time compared to manual data analysis techniques.
Yes, ChatGPT can generate basic spreadsheets with sample data, basic functions, and charts. It can also automate routine spreadsheet tasks through macros and formula creation. However, advanced spreadsheets for business intelligence, data modeling, or deep insights still require trained humans to design, validate and interpret.
While ChatGPT can be a useful tool to assist data analysts in some ways, it cannot be fully trusted to replace the work of trained human data professionals. ChatGPT is best used to augment human data analysis, not replace it. Any AI-generated analysis must be rigorously reviewed, verified and refined by data analysts to ensure accuracy, usefulness and avoid missing important insights.