Skip to content Skip to site navigation Skip to service navigation

Practical Techniques for Analyzing Data With Python

New session times will be displayed below upon confirmation.

Effective immediately in response to COVID-19, most Technology Training classes will be delivered online until further notice.

In advance of each session, Tech Training will provide you with a Zoom link to your class, along with any required class materials.

 


Almost every aspect of our lives is now data-driven, and data analysis has emerged as one of the most in-demand professional skills. Experts estimate that millions of jobs in data science might remain vacant for the lack of available talent. The global demand for skilled data scientists is not limited to demand for statisticians or computer scientists, but representative of an ever more pervasive need to understand data in every part of an organization. Well-rounded analysts with domain expertise are critical to performing accurate analysis of big data and making effective decisions with the information gleaned from the analysis.

 

This requires some experience in software programming, and statistical analysis, as well as exceptional communication skills.  This course leverages Python to provide users with a simple yet powerful tool for processing, organizing, examining, structuring, analyzing, visualizing, and consuming data.

 

Prerequisites: 

  • A Basic Understanding of Python, successful completion of required quiz
  • Ability to install python packages on your computer and set up the Jupyter programming environment

We highly recommend that you complete the quiz hosted on Gradescope, to test your readiness for this workshop.  If you score 70% or greater, you are ready to join us. 

Go to Gradescope: https://www.gradescope.com/

  • Click Sign Up
  • Use Course Entry Code: RZRR6V 
  • Select Stanford University

Otherwise, we highly recommend that you first complete the Python I offered by the Stanford Technology Training program and join us at a future session. 

 

Course participants will have an opportunity to learn:

  • How to work with 15 different Data formats
  • How to use four different Powerhouse Python Data Analysis Modules:
    • Pandas
    • SciPy
    • NumPy
    • Matplotlib
  • How to identify sources of useful data 
  • How to identify the advantages and disadvantages of different data storage types
  • How to develop practical skills for:
    • Working with publicly available datasets
    • Making your own datasets
    • Cleaning Data Sets
  • Simple Ways to Perform Basic Statistical Analysis on Datasets
  • Ways to practice their skills through in class exercises and activities

 

Participants will also receive a copy of the book: Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter 3rd Edition, by Wes Mckinney 

4 Modules -- 3 hours each (over two weeks)

Adam Keppler

Adam graduated Summa Cum Laude with his Bachelor of Science in Computer Science (CS) from UCI specializing in Computer Architecture (2017), and subsequently continued on to further his studies in the field with a Master in CS (2020) from the Stanford School of Engineering.

Custom training workshops are available for this program

Technology training sessions structured around individual or group learning objectives. Learn more about custom training


University IT Technology Training sessions are available to a wide range of participants, including Stanford University staff, faculty, students, and employees of Stanford Hospitals & Clinics, such as Stanford Health Care, Stanford Health Care Tri-Valley, Stanford Medicine Partners, and Stanford Medicine Children's Health.

Additionally, some of these programs are open to interested individuals not affiliated with Stanford, allowing for broader community engagement and learning opportunities.