I need information, now what do I do?
If you’ve ever been asked for information to support work you are doing or perhaps, to keep people informed of your efforts, you may have struggled with the process to define and deliver the information needed. If you identify with this issue, this post will cover three key questions you should answer to help set your requirements so that you can get to your information goal quickly.
What’s the Problem and Why Do I Care?
Our starting point is based on a quote from one of my favorite professors. “What is the problem and why do I care?” His point was that you have to be clear on the problem and what you are trying to address if you are going to be successful in formulating a solution. Clearly understanding the problem will enable you to be effective in gathering the right information for your answer. You should be able to state it clearly and easily.
Some examples are:
- What is the overall cost and schedule status of my project?
- Are my people overbooked in the fourth quarter?
- What factors are impacting my project’s delivery?
Otherwise, you are going to waste time pulling together data while looking for a question to answer.
What are you doing with the information?
The second question relates to how the information will be used. Essentially, the purpose will help drive the how to best structure the outcome. Many people fail to identify this aspect correctly, resulting in information that is not structured properly to meet the need. I’ll cover each of these outcome types in detail in subsequent posts.
Most likely, you are doing one or more of the following to either draw a conclusion or to illustrate a point. While these are broken out as distinct entities for illustration purposes, in many cases you will be using a combination of techniques.
In this example, we are attempting to understand why sales are so dismal in the North region for November.
When we look at sales data by region or we break down the number of hours entered against a project by each person, we are doing Aggregation. We may also create synthetic groups to aggregate data based on some attribute of the underlying data. In many cases, simple number charts are used to convey the data.
In the example above, we are looking at the November Sales numbers for the Blue, Green and Red Sales teams by region. The Red team seemed to do well in November, at least according to this view.
Typically, if you have aggregated data, it is likely that you need to compare and rank the groups of data. The Stack Rank is a very common scenario where you are ranking the data by Best to worst, based on some criteria. Number charts, bar charts and to a lesser extent, pie charts are commonly used for comparisons.
In this example, the West region actually had the most November sales of any region. The Red Team, leader on the previous view, actually sold the least amount in the West Region.
Here’s why the problem statement is important. Without having a clear definition of the problem, it isn’t apparent which answer is correct, as different conclusions can be drawn from the same data.
Another fairly common usage of information is to illustrate the composition of the data. In this scenario, we may be attempting to determine which region has the most salespeople. When used properly, a composition can be used to quickly convey relevant data. We see that the East and North Regions are staffed with a smaller number of salespeople than the other regions. This may give us a clue as to why sales are lagging in the North.
Another interesting information analysis you may choose to do is to understand how information changes over time. This type of visualization allows you to understand the direction of progress, beyond the current state, enabling you to determine which items may be more worthy of your attention. For example, projects that are late but are trending back to being on plan may be better off than a late project which is trending later.
In this example, we see that sales in the North region are flat and actually beginning to decline. In a real investigation, we would likely dig into this trend further since all other regions are growing.
Variation / Distribution
Another way to visualize the data is to visualize how the data varies for a given period. In many cases, the temptation is to only look at aggregated values or averages, but sometimes it’s the distribution of the data which tells a more compelling story. Readers who have a statistical background will be very comfortable with this type of information as distributions, variances and other such items are core to statistical investigation.
In this case, we see the majority of deals for November are small deals, with a second peak. This view would also provide a wider view of what’s happening. Do we have a training issue? Have vendors decided to cut back on orders due to the economy? Are there other factors at play? Without this view, these questions may not have been asked.
The last information type is to map out relationships. If you are deriving information from people relationships, you might here the term “social graph”, which is one way to construct, visualize and consume relationship data.. Relationship maps may uncover potential dependencies between items like people, which are not normally covered by work management and financial management tools.
In our example, one item jumps out of the data, in that the North region is covered out of one office. As the other growing regions are covered out of multiple regions, there may be collaborations on how to approach a customer that aren’t happening in the Chicago office. These collaborations may be resulting in more, small sales in the other offices. Further research is warranted but you should consider relationship mapping as part of your information arsenal.
What behavior do you want to occur, as a result?
One last aspect to consider is what story should the Information you gather tell? One way to determine the form of the story is to decide what behavior you want to occur as a result of the information you’ve gathered. Targeting specific behavior helps you decide in which fashion the data will be presented.
For example, if the focal point of the your information is to ensure certain upcoming tasks are completed on time, you may determine that a stack rank of incomplete tasks, ranked by days until Due Date, is the best way to present the data.
In the example used in this post, your intent may be to get another salesperson hired in the Atlanta office for the Blue team to support the North Region. You are able to illustrate a sales decline, some potential reasons for it and you would have to present where investment could improve the situation.
Three aspects that can be used to determine your information requirements were covered in this post. We’ll dive deeper into these and other relevant information in future posts.