CSCAR will offer a series of workshops on data science skills using Python. The workshops will be held in the Earl Lewis room in the Rackham building. All workshops will take place on Wednesday afternoons from 3:30-5.
The workshops are free and no registration is necessary.
We assume that participants are already familiar with basic Python. People with no experience using Python but who are comfortable using languages such as R or Matlab should also be able to follow the presentations.
Feel free to bring a laptop, but it is not required.
January 27: Data management with Pandas
February 10: Graphics and data visualization with Matplotlib and Bokeh
February 17: Pandas case studies (CMS data analyses):
February 24: Basic statistical analysis with Statsmodels
The released Statsmodels package is very old. If you want to get the latest (development) version of Statsmodels, the following should work:
pip install git+git://github.com/statsmodels/statsmodels@master
The documentation for the development version of Statsmodels is here:
http://www.statsmodels.org/dev/index.html
March 8: FLUX Geospatial analysis and MPI (R version) -- note special day, starts at 2pm
March 9: FLUX Geospatial analysis and MPI (Python version) -- note special time, starts at 2pm
March 16: SQLite: A small and lightweight database
March 23: Advanced regression analysis (GEE, mixed models and multiple imputation) with Statsmodels
April 6: Survival analysis with Statsmodels
April 20: Sklearn for predictive analysis and data exploration
Additional workshops will be scheduled on the following topics, dates to be announced:
Geospatial analysis
Building and accessing databases
MPI, parallel, and distributed computing