Quantcast

Data storytelling: promoted but barely researched

Data storytelling struck a chord with the data industry when Tom Davenport, of Competing on Analytics fame, gave it his blessing two years ago. Storytelling was a data scientist’s most important function, he said. Enthusiasts responded with columns, papers, and blog posts — but most of it did little more than echo Tom.

Fine, we agree that data storytelling is a good thing. But we still have scant observation of the actual practice of data storytelling — hard, direct evidence of what has worked, what hasn’t, which tools work, which career paths produce good practitioners, and other questions. I don’t even see consensus yet on what a data story is.

I intend to find answers from as many business storytellers — current, past, or future — as I can find.

What I want to know

  1. What is the definition of “data story”? How is data storytelling different from traditional storytelling?
  2. What characteristics do successful storytellers have?
  3. What frustrations or false starts have storytellers had?
  4. What career paths have produced successful storytellers?
  5. What are the most successful techniques? What are the least successful?
  6. What tools have proven the most helpful?
  7. What organizational cultures have proven conducive?
  8. What are any observed benefits of data storytelling?

Do you tell data stories in business? Have you tried or do you hope to try? Do you know anyone who fits this description? Please contact me here with your info or leads.

Google guys come to shake up BI with natural language

Kindergarten may have taught you all you need to know about life. But you may need to watch “Mr. Peabody and His Boy Sherman,” an animated, 1960s-era TV series for kids, to truly appreciate an interesting new natural-language product called ThoughtSpot.

ThoughtSpot’s natural-language querying represents a new stage of maturity for casual BI users — a step up even from visualization, which was a step up from rows and columns. ThoughtSpot users get data with natural language queries, otherwise known as questions.

ThoughtSpot opens with a Google-like interface. In a demo, a search for “total revenue last year” and visualized data appeared quickly with a simple dollar amount and, below it, bar charts with some detail. A followup query asked for a breakdown by age and gender, and a line chart showed the total revenue broken down that way. During what period? The query “last year quarterly” yielded yet more detail.

As in Google, queries type ahead based on past queries by others. And not just any others; the tool learns from the group. “Revenue in California,” for example, will show up first for one group while, say, “Revenue in Caledonia” will show up for another.

Inevitably, I suppose that many business users will try to do more analysis than they’re capable of. I can’t tell how deep ThoughtSpot will take them. But suppose they get into a problem and suppose expert help eventually arrives.

This is where I imagine that Mr. Peabody appears. In the TV cartoon, he was a brilliant, geeky dog, and Sherman was his pet boy. Sherman always had the interesting, pertinent questions. Mr. Peabody had enigmatic answers. “But Mr. Peabody,” Sherman often began, “why did they call it ThoughtSpot?” Mr. Peabody often gave a reply like this: “It’s elementary, my dear Sherman. It’s ‘thought’ for ‘thought’ and ‘spot’ for ‘spot.’ ThoughtSpot!” For me, they offer an allegory with data scientists and their business users.

“How did you get those numbers?,” the Mr. Peabody type might ask. To show him, the ThoughtSpot user hovers the cursor over a search term to reveal the identity of the table and column supplying that data. If that’s not good enough, a little window headed with “What am I looking at?” explains in a sentence.

The two ThoughtSpot representatives who conducted the demo for me proudly told about one such encounter in which the Peabody realized he’d been using incorrect data.

In the old TV cartoon, the dog is always smarter than the smartest human. Peabody and Sherman routinely travel back in time to render help at precisely the moment necessary to let such figures as Albert Einstein discover relativity or to align an apple to fall on Isaac Newton’s head.

If only His Boy Sherman could have sent the WABAC (wayback) Machine forward, instead of backward. Perhaps then he would have found a tool to let his innocent, unfiltered questions iterate toward insight with just natural language.

True, many of the world’s Shermans have already found that tools like Tableau and QlikSense let them iterate through questions and answers. But even such easy to use viz tools still don’t work for many without the patience or self-confidence to learn.

ThoughtSpot, say the two people who showed it to me, connects to just about any source of data, on the cloud or on premises, including Hadoop. It creates its own in-memory relational cache, though it doesn’t create its own aggregations. It makes an index of the data while retaining the original schema, joins, cardinality, etc.

If ThoughtSpot resembles Google, it should. That’s where four of the seven founders came from.

“We’re not a bunch of BI guys trying to bolt search onto BI,” said vice president of marketing Scott Holden. “We’re a bunch of search guys trying to reinvent BI.” They just might do it, even without a Mr. Peabody to swoop down from the future with his helpful paw. ThoughtSpot has that rumble and hiss of an invention about to break through the BI industry’s frontier. It doesn’t seem intended to replace heavy duty data analysis, at least not for now. But it does look like the easiest entry so far — lowering the ramp just enough for critical first steps, and maybe much more.

In fact, possibly the most important implication of the Google-like interface probably seemed too obvious to mention: Using it takes no training. Nearly anyone can do simple data analysis immediately.

Sherman might ask, “But Mr. Peabody, If I can do this, why do I need you?” I’ll bet that the smartest of the Mr. Peabodys would have thought they’d never see the day. Ah, liberation! They might be overjoyed to just fetch the data and to come when called. Natural balance will have been restored.

Learning from earthquake relief to design BI tools

Rescue dog. Humble, helpful.
Rescue dog. Humble, helpful.

You might say I’m crazy to see any connection between some big IT deployments and typical responses to big natural disasters — but that’s what I see. It fits a recurring theme across many disciplines of big interventions versus smaller, more humane and often more effective effort.

Continue Reading