6 Comments

    Leave a Reply

    Your email address will not be published.

    One Dataset, Five Line Charts

    Updated on: Dec 15th, 2015
    Data Visualization
    , ,
    HIV infection rates displayed in a small multiples area chart

    I recently partnered with a group of public health researchers.

    We were brainstorming different ways to display the number of new HIV infections in each subgroup. The organization wanted to compare HIV infections among men who have sex with men (MSM), African Americans, Hispanics/Latinos, and injection drug users (IDUs) between 2006 and 2010.

    As always, there are several correct ways to display these infection numbers.

    Option A: A Traditional Line Graph

    This is the traditional way to display the data, a classic line chart.

    We’ve got all the years displayed: 2006 through 2010, and all four subgroups of people are displayed within the same chart.

    But this isn’t the only way to display that dataset.

    HIV infection rates displayed in a line graph

    Option B: A Slope Graph

    Another option is a slope chart. A slope chart is basically a line chart that only shows two points in time.

    You might only have two points to display, like pre and post data. Maybe that’s the only time you collected data.

    Or, you might meet with your teammates and do some brainstorming and decide that your viewers don’t need to be distracted by all the peaks and valleys in the dataset. You might decide that your viewers really need to focus on the big picture. In general, did things go up, or did things go down?

    I’m usually a big fan of slope charts, but I don’t think a slope chart works best for this particular dataset. The lines almost look flat – like barely anything happened during that timespan. And maybe that’s the story you’re trying to tell – that HIV infection rates were relatively stable during this time period. But if you’re trying to emphasize that there were huge increases or decreases, then this chart isn’t for you in this particular circumstance.

    HIV infection rates displayed in a slope graph

    Option C: A Small Multiples Line Graph

    A small multiples line chart helps us disaggregate the four different subgroups.

    This style would be useful if you wanted to add subtitles or annotations above each graph. For example, you might type “MSM” and then below, “HIV infections rose slightly in 2007, decreased in 2008 and 2009, and then increased again in 2010.”

    HIV infection rates displayed in a small multiples line chart

    Here’s a twist on the small multiples line chart. It combines a technique called color highlighting with the small multiples design.

    I use this approach when I want my viewers to see the contextual information,  like the number of new HIV infections for men who have sex with men (the blue line on the first chart) compared to HIV infection rates for the other groups.

    HIV infection rates displayed in a small multiples line chart with shading

    Option D: A Small Multiples Area Graph

    When you’ve only got one line per graph, another option is to transform your line chart into an area chart. An area chart shades in the area underneath each graph.

    This is a really popular design because people say it’s easier for our brains to see that big, filled-in, dark shape than to spot the skinny line all by itself. The skinny line on the line chart is competing with the axes and the grid lines for our brain’s attention. But there’s no way you can look at this graph and not spot those big blue chunks right off the bat.

    HIV infection rates displayed in a small multiples area chart

    Option E: A Small Multiples Slope Graph

    A fifth option for this dataset is a small multiples slope graph.
    Remember, slope graphs have two points in time. So we’ve just got 2006 and 2010 HIV infections here.

    HIV infection rates displayed in a small multiples slope chart

    The takeaway message is that there are at least a dozen correct ways to visualize each of your datasets. It’s your job to sit down and sketch all the options on paper, and to weigh the pros and cons of each layout, and then to choose which layout corresponds to the message that you’re going for.

    Which style do you prefer for this particular dataset?
     
     
     
     

    More about Ann K. Emery
    Ann K. Emery is a sought-after speaker who is determined to get your data out of spreadsheets and into stakeholders’ hands. Each year, she leads more than 100 workshops, webinars, and keynotes for thousands of people around the globe. Her design consultancy also overhauls graphs, publications, and slideshows with the goal of making technical information easier to understand for non-technical audiences.

    6 Comments

      Leave a Reply

      Your email address will not be published.

      You Might Like

      Our complimentary mini course for beginners to dataviz. Takes 45 minutes to complete.

      Enroll

      How to Visualize Margin of Error Data in Excel with “Slider Plots”

      Andrew Forsman is a Depict Data Studio student and self-described “data viz nerd” who has over 10 years of experience helping organizations plan for, execute, and learn from research and evaluations. Andrew’s sharing examples of slider plots and step-by-step instructions for making them in Excel.

      More »

      Inside our flagship dataviz course, you’ll learn software-agnostic skills that can (and should!) be applied to every software program. You’ll customize graphs for your audience, go beyond bar charts, and use accessible colors and text.

      Enroll

      Subscribe

      Not another fluffy newsletter. Get actionable tips, videos and strategies from Ann in your inbox.