Module 05 - Assisting with Analysis Workflows

Generative AI for Data and Analytics

In this module, we focus on streamlining and simplifying the various stages of data analysis workflows. Standard preprocessing steps, often time-consuming, are addressed, including data cleaning, parsing, and creating high-level overviews. We provide resources which offer guidance on breaking down complex tasks into manageable steps, constructing effective workflows and roadmaps, and seamlessly connecting individual steps into a standard preprocessing pipeline. All of the above provide immunologists with practical considerations for enhancing their data analysis processes.


Table of Contents

  1. Generative AI for Data and Analytics
  2. Data Preprocessing Tasks
  3. Guide to Using ChatGPT for Data Science Projects
  4. Translating Code
  5. Other Applications of AI for Data Science
  6. References

Data Preprocessing Tasks

ChatGPT can be a valuable integration into existing data science workflows. Data cleaning and preprocessing are often very time-consuming steps for any big analysis projects. Luckily, a tutorial on KDnuggets outlines how to use ChatGPT to help with a few tasks. Check out the site for step-by-step prompt and code examples.

  • Fetch and load the dataset
  • Check for missing values
  • Encode categorical variables
  • Visualizing the values of categorical variables
  • Visualizing the distribution of class labels
  • Tying it all together into a data cleaning and preprocessing pipeline

Guide to Using ChatGPT for Data Science Projects

If you’re curious how to take it even further in an end-to-end data science project, check out this datacamp tutorial. The following steps are covered with example pictures and code:

  • Project planning
  • Data analysis
  • Data preprocessing
  • Model selection
  • Hyperparameter tuning
  • Developing a web app
  • Deploying it

Translating Code

One user on Twitter/X shared their favorite use fo ChatGPT, which is to translate code from one language to another.

Translate Code 01

Alt Text: (Source: Twitter/X)

Translate Code 01

Alt Text: (Source: Twitter/X)

Other Applications of AI for Data Science

Practical Data Science on AWS: Generative AI Presented by DeepLearning.AI, this is a webinar-based tutorial introducing the practical aspects of using tools like ChatGPT and Stable Diffusion for data science tasks within the AWS cloud.

Excel AI – Data Analysis Made Easy A guide to getting started using the language-based generative AI functionality in Excel to uncover insights in your data.

References


© 2023 Anonymized Authors per NeurIPS Workshop Submission Policies [MIT License]