Overview: R is mostly known for its statistical methods on numbers. However, it is also capable of working efficiently with text and analysing it. In this workshop we will take a deep dive into how to work with text in R and introduce you to functions that allow you to even do text mining.
At the end of this workshop, you will be able to
· manipulate strings efficiently
· import text from multiple sources
· lay the basis for working with regular expressions (regex)
· do some text mining with the tidytext package
Coding Ateliers
As part of CDSI, the Data Science Solutions Hub (DaS2H) offers Coding Ateliers to the entire McGill community, where you can drop-in during atelier hours with your laptop, work on your research project and, whenever you have a question, we would be happy to assist you on the spot. If your question turns out to be a bigger issue that we cannot solve instantly, we will advise you how to best proceed.
Overview: In this workshop, students will learn:
- The different types of plots depending on the characteristic of their data/variables
- How to plot by group
- How to assign different aesthetics (size/color/shape/etc)
- How to refine their plot (title/axis labels/legends)
- How to save their plot in their local machine
Prerequisites:
- Introductory knowledge of Python
Coding Ateliers
As part of CDSI, the Data Science Solutions Hub (DaS2H) offers Coding Ateliers to the entire McGill community, where you can drop-in during atelier hours with your laptop, work on your research project and, whenever you have a question, we would be happy to assist you on the spot. If your question turns out to be a bigger issue that we cannot solve instantly, we will advise you how to best proceed.
Overview: Many Generative Artificial Intelligence (GenAI) models now offer support in programming with different languages. In this workshop we will explore different models like OpenAI's ChatGPT, Anthropic's Claude and Microsoft's CoPilot in programming with R. We will discuss strengths and weaknesses, when and how to use GenAI using R.
As part of the data science life cycle series in R, this workshop will introduce you to creating a dynamic website to display your analytical results and data sets. The R shiny package allows you to create nice-looking interactive dashboards.
At the end of this workshop, you will be able to
- understand the logic of building shiny apps
- build a basic dashboard
- create input fields and manipulate the output of the dashboard
Coding Ateliers
As part of CDSI, the Data Science Solutions Hub (DaS2H) offers Coding Ateliers to the entire McGill community, where you can drop-in during atelier hours with your laptop, work on your research project and, whenever you have a question, we would be happy to assist you on the spot. If your question turns out to be a bigger issue that we cannot solve instantly, we will advise you how to best proceed.
Overview:
In this workshop, we will delve into the art of conducting Exploratory Data Analysis (EDA) on a given dataset. EDA encompasses a broad spectrum of critical data analysis components, which include, but are not restricted to, the following:
Machine Learning in Python - Neural networks, data leakage, the train/test split
Coding Ateliers
As part of CDSI, the Data Science Solutions Hub (DaS2H) offers Coding Ateliers to the entire McGill community, where you can drop-in during atelier hours with your laptop, work on your research project and, whenever you have a question, we would be happy to assist you on the spot. If your question turns out to be a bigger issue that we cannot solve instantly, we will advise you how to best proceed.
The internet is full of data that you can use for your analysis. As part of our data life cycle series, we will introduce you to efficiently retrieve data from the internet. We will look into downloading files, using APIs, web-scraping, and downloading feeds.
At the end of this workshop, you will be able to
- use an API
- understand ethical web-scraping (i.e. how to download data from a website)
- understand how to work with RSS feeds
Coding Ateliers
As part of CDSI, the Data Science Solutions Hub (DaS2H) offers Coding Ateliers to the entire McGill community, where you can drop-in during atelier hours with your laptop, work on your research project and, whenever you have a question, we would be happy to assist you on the spot. If your question turns out to be a bigger issue that we cannot solve instantly, we will advise you how to best proceed.
Overview: Data is stored less and less in files as the amount of data grows more and more these days. Databases, however, can handle large and complex data sets well – either on your computer or in the cloud. Databases include powerful tools to store, search and analyze data sets. In this workshop you will learn how to use one of the most used databases: PostGreSQL.
Coding Ateliers
As part of CDSI, the Data Science Solutions Hub (DaS2H) offers Coding Ateliers to the entire McGill community, where you can drop-in during atelier hours with your laptop, work on your research project and, whenever you have a question, we would be happy to assist you on the spot. If your question turns out to be a bigger issue that we cannot solve instantly, we will advise you how to best proceed.
Coding Ateliers
As part of CDSI, the Data Science Solutions Hub (DaS2H) offers Coding Ateliers to the entire McGill community, where you can drop-in during atelier hours with your laptop, work on your research project and, whenever you have a question, we would be happy to assist you on the spot. If your question turns out to be a bigger issue that we cannot solve instantly, we will advise you how to best proceed.