Monday, June 8, 2015

Bed & Breakfast Analytics Foundation #1: Burn the silos!

We are surrounded by skilled people. We are also surrounded by unskilled people. How do we manage such diversity? How can we be sure that our Data setup is the one you need? Discover my thoughts throughout this post, being the first one of a series of posts explaining the foundations of this blog. Want to read it? Avant!

Already as an experienced worker, in my first job as a BI and Analysis "manager" (at that moment I had no clue what a manager was because I never had good managers), I was first required to make an evaluation of their set up on the Data and Analysis department, although they felt very proud of their Business Intelligence efforts. The first question I asked, ironically, was: "Ah, do we have such department"? I was presented a Data Scientist, a Campaign Manager, a guy that was creating some dashboards in Excel, and a self-denominated UX expert. That was the marvellous team. After talking individually with them for a couple of days I came back to my Manager and said to him: "Sorry, you have no team. You have a set of skills but no cohesion between them. You built your Data strategy based on silos!". "What's your recommendation?", he said. The answer was the easiest one I've given in my life: "Burn the silos!".

Is my Data and Analysis setup the right one? Sorry, I would say: NO. I can bet my next month's salary that there are important flaws in the way you've organized your Data, BI, and/or Analysis team.
Before starting, let's make very clear what is probably the most important topic when building your Data team: there is no unique and/or magical solution. Every Data team should be built according to the Business needs, the company culture, the skills available at that moment, the available budget, and a long etc. I'm going to give you the classical consultant answer to the question of how to organize the Data team: "Depends". If you came here wanting to seek a magical solution, this is definitively not your place. Further more, this is not your best job. Building up the best team possible is a tough task, and, for sure, not an easy one. It will require a lot of trial and error and a lot of adjustments.
The basic outcome: what do you expect from your Data team?

I was told many times that the Data and Analysis team should give added value to the company. If you think that a Data or Analysis team should bring knowledge then I must say you're partially wrong. Knowledge is the path, not the mean. The ultimate end of tracking, analyzing, extracting knowledge, etc., is to create action! A feature in the website should not be tracked or analzyed if these actions bring no action to the business. The team should be able to trigger a change in the business when delivering knowledge based on an analysis. If there is no action out of these efforts, then the whole Data strategy is totally worthless.
Simplifying the lifecycle of a Data team, we could say that there are four main steps in an analysis strategy:



In first place we have the business needs. The team should be able to gather them, understand them, and forecast what they could look like, in order to deliver Actions proactively. This implies clear and crystal communication skills with the business owners. Once the need is clear, we proceed to evaluate the availability of the necessary data for the potential analysis (in a separate post I'll come back to this point). Then we analyze, out of which we obtain knowledge. Afterwards we come out with actions: we need to make sure that every single analysis done, every single report delivered, any single data extracted is actionable, in the sense of provoking some change, some further thinking on the management level, or, in the last-but-not-least case, a change of operations and/or strategical decisions. If an outcome is not actionable then, simply, should not take a single minute of the team's time. This is where many companies and set ups fail, and actually creates the sensation that there should be no further investments on the Data and Analysis team. As well, I'll come with further insights in following posts.

Now that, in a high-level point of view, we understand how the lifecycle of Data works it's time to understand that this process, as it's exposed right now, it's based on silos. Each step is traditionally done in independent ways: the web analyst specifies and validates tracking, the data scientist probably determines that a regression analysis is the most suitable to mine the data, the campaign manager states that some campaigns should be stopped, the reporting manager tries to gather-and-show all this information (after it's being generated), etc. All this process, again, has been built over a silo basis. With the process implemented this way is virtually impossible to extract real action out of it.

Here and now is where the Manager needs to step in, in order to wrap up these processes into a single one. Then, and only then, we could start talking about having a successful set up of our Data and Analysis team. When doing this for the first time, is very common to fall into one of the two following mistakes:

Excessive focus on Tracking+Analysis:


This is the most common mistake. Everything needs to be tracked and stored, said the CEO. There's not enough budget for that, said the CFO. There are not enough resources, said IT. There are not enough skills, said the analyst. There's not need for all that, should be saying the BI Manager. If you decide to track, store, and analyze absolutely everything, then I must say you have the data-version of the Diogenes Syndrome. You gather a lot of knowledge. You gather a lot data (which generates a cost that should not be underestimated). However, if there is no clear connection with the Business, we'll never be able to derive action out of this massive amount of knowledge. In other words, all this data and knowledge is totally unorganized and unstructured. The Manager should be able to gather concrete business needs, or, even better, identify clear business opportunities, and then (and only then) make sure the data is available and that a proper analysis is conducted. Actions will appear automatically after this process is properly followed.

Disconnection form the Business:


This is a very dangerous mistake. This set up is what I call the corpses set up. Deriving actions out of a massive tracking and analyze phase is like walking around by killing everybody you find in your path. If there is no connection with the Business (the motivation, the core of Data existence), the proposed actions will result in a worthless effort. Usually this set up ends with a generalized burnout of the Data and Analysis team. If the team (or its Manager) doesn't understand why are we tracking and analyzing, then the actions that we'll derive out of it will not answer any question. Let me depict this with a concrete example. As an analyst in a flash-sales online vendor I was asked by the COO: "For how long we should be running Promotional Sales campaigns? Please, tell me a number and I'll make sure that we'll implement such length". Indeed, was a very simple request, and it took me less than one day to come with an answer (actually, it was 4 days). As promised, the COO implemented by force such number, although nobody understood where that number came from. Even more, at the Data and Analysis team we ignored at all why we were running such campaigns. That is, we did not understand the essence of the request. In other words, the COO was assuming that we should have such campaigns "in order to generate unique buyers". A simple analysis afterwards revealed that such campaigns weren't generating more unique buyers, or that such buyers were spending more after a given time. At the end, by understanding what was the Business need (are Promotional campaigns worthwhile?) we were able to collect the necessary data and to conduct specific analysis to come with a clear action: kill Promotional campaigns. This was only possible by means of wrapping up all four silos in a unique and crystal process: from the Business need to the action.

A new way to understand the wrap-up.

The previous example showed that, indeed, we should consider all four stages as part of a single process. However, there is a way to further pressure the process: instead of considering it a linear process, consider it a cycle:



This is nothing more than having a proper follow-up of the implementation of the proposed actions. Ideally, such implementations will lead on more business needs, that, at the same time, will need more data and analysis in order to be optimal. Part of the Manager's skills we should count for proper feedback loops, implementation follow-ups, etc. This implies that very strong communication skills and technical and architecture knowledge are a must-have.

This cycle will automatically derive in chasing new business opportunities. In other words, when the team, and its manager, perform the proper follow-ups, and closes the cycle, then they will start to do analysis by themselves. Why? Because they will understand the business and check whether the implementation were a success or not.

Now this needs to come to an end. You can have the best business analysts, web analysts, campaign managers, data scientists, etc. etc. etc. But you will only succeed if their manager is able to wrap-up these silos, from the business needs to the implementations and assessment of the proposed solutions. It well worths a try. Guaranteed!

Tuesday, April 14, 2015

Bed & Breakfast Analytics: The 10 Motivations and the 10 Foundations

We are surrounded by data. What can we do with that? For what? How? What can we expect and what not? What are the common errors? Size matters? Open source or commercial tools? Here you should find some tips to discover your own journey on the data realm. Bon voyage!

I still remember a nearly-hilarious situation I faced in my first job as data analyst. The CEO came to me and asked me for tons of data very important for a strategic decision. Wow! Panic! Just graduated from College. Just landed in this job. No clue about the business. No clue about data structure. No clue about KPI names. No clue about anything. After some minutes of panic, I deeply breathed and tried to deliver what I've been requested. I promised I did the best I could: gathered data from different departments (no Data Warehouse, no unique source of truth) and different people, in very different formats, I used some advanced and fancy stuff in Excel, and, after 10 hours of intense job, I delivered a kind-of-report. I really had no idea what I was doing. I had no idea what data I was delivering. Some days after I went back to my CEO and asked him how useful my data was. His answer was: "Which data? Ah, that report. Well, we did not use it. We took the decision XXX based on a market research the CMO found in a blog". I suppose I should say thanks.

Sorry. Probably your data-setting is not correct. You should consider start reading this.

After years of experience, probably you've heard these stories many times. The Marketing Manager (random Manager example) requires some data. Let's depict some standard scenarios. The requested data corresponds to...

1. ... clicks, sessions, bounces, etc. This one should be easy. The Web Analytics Manager easily performs this task (is part of its basic skill set), probably by applying some complex advanced segments to the data (easy does not necessarily mean simple). Nowadays, the implementations of the web analytics tools trend to be very complex, mainly because they need to cover a lot of business cases. Simple, right? Well, now imagine that for some unfortunate reason, the Web Analytics Manager is on vacation. Panic! Then, the request is given to, let's say, a Campaign Manager. Of course, he/she has access to the web analytics tool, and hence tries to retrieve the requested data. A bit of panic appears, as the data seems not coherent (of course not, he's not applying those complex advanced segments he should be applying). He then tries to search for some documentation regarding this topics and...  surprise! he/she finds no such documentation. Finally he delivers some numbers, but they all know those numbers might not be totally reliable. At the end, and as the requests get more complex, the process to retrieve such data gets more complex as well. If the process is not clear enough for all stakeholders, the result is a lack of trust on the delivered data, leading to a lack on trust on the data strategy (if such thing exists in the company).

Here I already find my first three motivations:

  • Motivation 1: there is a lack of proper documentation. Information transference is virtually inexistent in many e-commerce companies, specially for data-related topics.
  • Motivation 2: business complexity translates directly into data complexity. Not every stakeholder understands this implication.
  • Motivation 3: wrong data strategy leads to lack of trust and, even worse, to wrong decisions. 

2. ... revenues, sales, etc. This one gets a bit trickier. The Marketing Manager pings somebody by BI, or by Finance. Traditionally the request is not complete or it's poorly written: time frames missing, before/after refunds, etc. Normally, such simple requests, requires 2-3 iterations, leading, again, to a lack of trust in the provided data. In some cases, a variation of this scenario takes place: reports and data is built by manually joining data retrieved from different data sources, as the full data map is not clear for everyone.

Again, two more motivations appear:

  • Motivation 4: it's very hard to write clear requests.
  • Motivation 5: outside our comfort area, finding data could be a challenge. Even when having a data warehouse, or a nice-and-expensive-but-totally-useless BI tool.
  • Motivation 6: it's easier to request than to retrieve, and it's easier to retrieve than to process.

3. ... data that has been already requested any time (many times?) before. This one is a quite disappointing. There is nothing more frustrating, in both directions, that performing a recurrent request, and being requested for the same time after time. Assume for a second that, indeed, such data is available. Why is data recurrently needed not easily available? Even worse, what if we have (as I mentioned in the previous paragraph) a very nice BI tool? Why some users are reluctant to use self-service data platforms? Now, assume that the data isn't available. Tough times are about to come: it's time to reach IT in order to start gathering this data. Normally, from a BI/Data department is very hard to write clear specifications for IT to start gathering some data, due to several reasons: lack of knowledge on the platform, lack of database architecture knowledge, etc.

With this, two further motivations appear:
  • Motivation 7: having a BI tool does not ensure self-service. Having a self-service platform does not ensure data availability.
  • Motivation 8: communication between BI and IT could be a struggle.

4. ... data, or analysis that we don't know whether it can be accomplished or not, or data which is not clear how is going to be used upon delivery. The first challenge when receiving a request, or when performing it, is to determine whether it can be done or not (assuming whether it makes sense or not). Many of the analysts work directly with data, without designing a plan for such analysis or request. That is, both requesters and analysts work without an analysis framework, even when it's clear that the analysis will require some time to be finished, probably due to its complexity. A different case appears when the request is coming from the CEO. We have to admit that is very hard to say no to our CEO. However, the CEO does not know everything, and he's not always right. Even CEO's requests need to be challenged, understood, and accepted.

With this, we find my two final motivations:
  • Motivation 9: working with analysis framework is a must-have.
  • Motivation 10: determine whether a request (for data or for an analysis) makes sense. Find the way to challenge every single request.

The Decalogue: the 10 Foundations of Bed & Breakfast Analytics

With my thoughts over the desk, and the motivations I find out of them, I'm ready to state my Decalogue.

1. Burn the silos! Managing data requires transversality and deepness on each vertical. Skill silos are not suitable any more.

2. Complexity matters! Understand how business complexity affects data complexity.

3. Better alone than... No data is better than wrong data.

4. Write, write, and write. Documentation is a must-have. Learn how to document and learn how to request.

5. Going beyond your comfort area. You should consider expanding your comfort area. Even more, you should consider not having any comfort area at all.

6. Bring order to chaos. Narrow your analysis: understand the need, design and framework, and only then, retrieve data.

7. Communication is the key. Your CEO does not care about regression models, decision trees, or how fast your database engine is. He wants a way to keep a sustainable and profitable business.

8. Choose wisely. The right tool for the right set-up. Self-service is not always the best solution.

9. Don't rush! Data is a path with some mandatory steps. Cheating leads to frustration and lack of trust.

10. So, how are you doing? Integrate data. Move aways from data silos. Design KPIs, reports, and dashboards based on integrated data.


So, what's next?

With all these, I want to share with you how do I measure, why do I measure and how do I analyze, with the hope that you will join me walking through the learning curve of the data-related world. In a Bed and Breakfast hotel you share your experiences with many others, and you obtain a clean and cheap way for sightseeing. This is exactly what I pretend to do here: every two or three weeks I will be sharing my thoughts, tips, tools, and techniques. Everything what I know will be shared. I'm willing to do so!

Hope you find this interesting, and welcome onboard!