Lies, Damned Lies, & Data Visualization

Graph 3190-01 | RISD | Spring 2017

Charts and graphs have an indisputable aura of objectivity and yet, much like statistics, they have an immense power to either elucidate or mislead. What makes an information graphic ‘trustworthy’ and how do designers know that their work is telling the ‘whole story’? In this course we will deal with the nuts and bolts of collecting & processing data and explore different ways of communicating its meaning in a quantitatively rigorous and visually engaging way.

Assignments will involve the use of scripting, databases, and other numerical tools to transform data into something that is understood rather than simply ‘seen’. Students are encouraged to consider data sources from their surroundings or the larger world and to break away from the screen-based status quo, eschewing the expected line graphs, pie charts, and tables in favor of unconventional visualizations of their own devising.

Objectives
  • to give students an understanding of the process of acquiring, analyzing, refactoring, and visualizing data
  • to develop an understanding of the building-blocks of visual data representation (bar charts, scatter plots, network diagrams, etc.), know when each is appropriate, and learn to avoid their associated pitfalls
  • to discuss the epistemological issues raised by being an irrational primate attempting to make systematic sense of an unverifiable world
  • to establish Hypothesis Testing as a working method for developing visual explanations and discovering the ‘story’ within a dataset
Prerequisites
  • basic bitmap, vector, and (potentially) video-editing skills
  • familiarity with statistical reasoning (mean, median, sorting, normalization, etc.)
  • facility with a scripting language/data visualization library (d3, SciPy, R) or other data analysis tool (Mathematica, MATLAB, Excel)
  • not required but helpful: knowledge of databases, server-side programming, interaction design, and animation (or audio)
Readings

Readings will be assigned weekly covering both formal and conceptual issues involved in data science. We will discuss the readings in class in relation to the current assignment and each other’s coursework. Each student must submit 3 questions to the class website before 8 a.m. the day of class.

These questions will act as prompts for the in-class discussion, so anything that can be answered with a ‘yes’ or ‘no’ is probably not up to snuff. The questions should not be questions for the instructor but are intended for your fellow students. You must come to class prepared to discuss the texts.

Assignments

There will be 4 assignments over the course of the semester. Though there will be a ‘final’ crit in class at the end of each of these, you may continue working on them throughout the semester. Your final grade will be based on the versions you turn in at the last class meeting on May 15th.

Presentations

Each student will give a pair of presentations on an artist, designer, or technical topic (visualizations tools, algorithms, etc.) during the course of the semester. Presentations should be about 10 minutes long in the format of your choice (slideshow, website, mini-lecture).

You must submit an online summary of your presentation as part of your class documentation by the final day of class. This PDF or online summary should cover the main points of your presentation as well as including appropriate visuals and links to resources or additional information on your topic.

Presentation topics (and dates) will be chosen in the second week of class. One of your presentations must be drawn from a provided list of options while the other will be a topic of your own devising.

Grading

Participation: 20%
Final assignment: 15%
Presentations: 15%
Assignments 1–3: 10%
Reading Questions: 10%
Attendance: 2 unexcused absences==instafail

Final-grade Archetypes

F – frequently late and/or absent. insufficient participation. little to no understanding of formal and quantitative practices.
D – occasional lateness and more than one unexcused absence. basic understanding of subject matter.
C – occasional lateness. demonstrated an understanding of subject matter. failed to take risks. work holds together. makes only obligatory contributions to discussions.
B – always present. work in on time. demonstrated a solid understanding of subject matter. was able to seek out new design principles and technological approaches. work has good form and content (and took some risks). able to make interesting contributions to the class.
A – always present. work in on time. demonstrated a solid understanding of data visualization. was able to explore new approaches. work has excellent form and content which took major risks. always makes interesting contributions to the class and frequently led class discussions.