How to analyze unfamiliar data: circle, dive, and riff

When you come face to face with unfamiliar data, how do you proceed? How do you avoid sending yourself and your shiny “speed of thought” tool slamming into a dead end? Dan Murray’s got a routine — and he’s also got certain music and right-brained books to go along.

Dan’s first rule: “Don’t pre-think.” It’s the hardest thing for people to learn, he says. “If you go into [data analysis] thinking you know where you’re going, you easily miss the granule of gold.”

He’s the chief operating officer and heavy-hitting data analyst at InterWorks, Inc., an Oklahoma-based business consultancy. What seems to me like an unending stream of mid-size businesses from all different industries has kept him running days, nights, and weekends to make sense of each one’s data and unravel old data knots.

From an airport somewhere in the South, he explains, “You have to think like a writer thinks. You don’t know where the story’s going to go.” Screenwriters and novelists often say in interviews that their characters veered off in directions the writer hadn’t anticipated.

He’s been analyzing data ever since spreadsheets first became available in the early ’80s. “I was a huge spreadsheet guy.” Now his tool of choice is Tableau.

The routine goes something like this.

First, get the big picture. Grasp the general outline. How many records do you have? What’s the highest and lowest? For example, if you’re looking at a company’s sales, how many sales, units sold, and so on?

Look for what pops out. Trends often make themselves obvious right away.

Find groups. Build a bar chart to see how it all breaks down. If you’re looking at sales, make groups of products, divisions, for example.

Lay out timelines. Build time series to see any long term trends. Start simply with years, then break it into more detail.

Make maps. If the data contains locations, throw it on a map and see what clusters appear.

Go on tangents. Try making some measures into dimensions. For example, if you have a million invoices, with a range of up to a million dollars, where do most invoices fall? Try cycling through every type of chart. Remember, the cost of any view is just one click.

Look into outliers. Outliers may be just bad data, or they may be interesting. A good place to find them is in scatterplots. “Most of my interesting discoveries are in scatterplots,” says Dan. Seemingly unrelated numbers sometimes have some kind of interesting correlation.

Combine. Put all the charts done so far into one dashboard. Filter all the views based on [things I highlight]. There you can see it all at once. Brains don’t remember more than one or two things at one time, but here you see it all together.

Repeat. Good tools make false steps easy to back out of.

Keep an open mind. He plays music, often the piano music of Frank Kimbrough, such as”The Spins.” He emails, “The lyrical and circular notions of this song reflect how I do analysis. He circles, he dives, he riffs, and then he comes back and does it again in a slightly different way.”

Present and persuade. Jazz, right-brain thinking, motivation, surprise, discovery — it all results in discoveries that must be communicated persuasively for any value to result. Dan recommends the two books by Dan and Chip Heath, Made to Stick and Switch.

Three hours of analysis will show you plenty. “You’ll know just as much as the insiders know.”

Do you have a routine for analyzing unfamiliar data? I’d especially like to hear from users of many different tools, from the most advanced to pencil-and-paper. Please introduce yourself here.

3 Responses to How to analyze unfamiliar data: circle, dive, and riff

The data industry thrives on conversation. Please submit a comment.

Other recent posts

Bad stories stop good data at the water cooler

We agree by now that data’s a good compass. One neglected question is tougher: Which map? Everyone’s known the kind of “grouchy guy” TDWI instructor Kellee M. Franklin, Ph.D tells about. This guy knew better than most of his co-workers about how their Washington, D.C. defense agency worked. And he was frustrated. Over the years,… Continue Reading

Tableau plus HyPer: “Something up their sleeve”

Yet a third reaction has come to Tableau’s announcement that it has acquired HyPer, the German “high performance database system.” “It seems sort of muddled,” wrote Dave Wells in email to me yesterday. He’s a longtime Tableau observer, a 40-year IT veteran, and now a consultant and educator at Infocentric. For six years, he was… Continue Reading

As if our data didn’t have enough problems

The assessment of worldwide threats issued yesterday by director of national intelligence James Clapper has one more topic for panel discussions at data-industry events. The Worldwide Threat Assessment of the US Intelligence Community says the Ruskies, Chinese, and others will “almost certainly” try to do more with our data than to steal it. Future cyber… Continue Reading

“Big changes” in storytelling: simple and simplistic

The storytelling world shook this morning with this headline from Tableau: “Data storytelling is undergoing a big change.” The blog post lists three changes: scrolling with less clicking, simpler charts, and visualizations that weave into the narrative. What is really changing? Not much, and to call it “big changes” is worthy of a trashy tabloid… Continue Reading

Andy Cotgreave on data without emotion

Tableau’s senior technical evangelist Andy Cotgreave has boarded the data storytelling wagon. Actually, I don’t know how long he’s been there, but an article he wrote caught my attention today. He says that data without emotion is “worthless.” I agree! Consider also the terrible Syrian refugee crisis affecting the Middle East and Europe. This tragedy… Continue Reading

Notable marketing: Have imagination, will be read

For all the marketing collatoral the data industry produces, there’s little that I can read without forcing myself. But when the good stuff comes, it’s like a gust of spring air blowing into a stuffy room. That kind of marketing blew into Datadoodle headquarters Friday morning. VisualCue, maker of visualization software done with “tiles,” won… Continue Reading