For my final project, I’m going to explain basic SQL query syntax, including SELECT, FROM, and WHERE clauses. I will assume that they are familiar with terms for describing data in spreadsheet form, and if they understand a little basic set algebra (union and intersection) I’ll explain joins. If there is time, I can get into ORDER BY and GROUP BY.

I’m going to prepare material for all of these topics, and order it in the order above. If my audience (co-workers without SQL experience) is interested, I’ll push through until it’s all done. Otherwise I’ll stop after SELECT/FROM/WHERE.

I will post my concept map, assessment questions, and course materials below, but they will also be in this git repository.

UPDATE: Of course I’ve realized that I’m attempting way too much information for 8-10 minutes. I still like my assessment questions and feel like I can cover that material in 8-10 minutes, but my original concept map is too expansive. It was a brain dump instead of a more careful list of things I could realistically cover.

My concept map is below:

Assessment questions:

You have collected experimental data and stored it in a database with the
following rows:
 sample_id, experiment_date, mass 1001, 20130730T063000, 17.62 1002, 20130730T063000, 16.40 1003, 20130730T063000, 15.93 1004, 20130730T063000, 15.96 1005, 20130730T080000, 17.34 1006, 20130730T080000, 17.01 1007, 20130805T133000, 17.07 1008, 20130805T133000, 17.25 1009, 20130805T133000, 17.16 1010, 20130805T133000, 16.23 1011, 20130805T133000, 16.06 1012, 20130805T133000, 16.89 1013, 20130810T091500, 17.29 1014, 20130810T091500, 17.02 1015, 20130810T091500, 16.48 1016, 20130810T091500, 16.67 1017, 20130810T091500, 17.22 1018, 20130810T091500, 17.51 1019, 20130830T140000, 17.28 1020, 20130830T140000, 17.97

What query would you use to find the sample_id for all samples obtained before August 10th with a mass of at least 17?
What query would you use to find the average mass of all samples obtained on July 30th?

Course Materials:

iPython notebooks are in this git repository. I created SQLite databases and connected to them using the ipython-sql module. While this complicated matters by requiring %sql magic to be entered before all commands, it allowed me to use an iPython notebook as a presentation.

Feedback:

This was a little too basic for my target audience. They would have benefited more from explaining the various JOIN clauses.
Using ipython-sql for people who aren’t particularly familiar with ipython may be a little confusing. It probably would have been better to use something like SQLite Manager for Firefox.
Assessment questions were well matched to the material.
It probably would have been better to be more thematically consistent with the sample/assessment datasets.