By clicking “Accept All Cookies”, you agree to the storing of first-party and third-party cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Cookie Policy for more information.

Jumpstarting the data engine for startups

Samaipata Editorial Team
September 3, 2024

Written by the Samaipata Editorial Team

This is the fifth episode of "No-Filter Talks," an interview-based video series that invites our Operating Partners to speak openly about their experiences, offering valuable lessons and sometimes hard-to-swallow truths for our founders' success.

In the fast-paced world of startups, navigating the complexities of data management is critical, yet often misunderstood. Emmanuel Martin-Chave, VP of Data at BlaBlaCar and Operating Partner at Samaipata, brings a wealth of experience in scaling data strategies for high-growth companies. With a background that includes leading data teams at BlaBlaCar and contributing to the development of data-driven cultures across various organisations, Emmanuel is well-versed in the challenges and opportunities that data presents to startups.

In this episode, Emmanuel shares his insights on when and how startups should prioritise building a data team, the importance of aligning data initiatives with business objectives, and practical strategies for keeping data management simple yet effective. For any founder or startup professional grappling with data-related decisions, Emmanuel's expertise offers a roadmap to making data a powerful tool for growth.

Samaipata: Does an early-stage startup need a data department?

Emmanuel: An early stage startup, I think, doesn't need,  or I'd rather say it cannot afford a data team from the start.  If you have money, spend it on engineering, product sales,  because at the end of the day, you're finding your product market fit  and that at the beginning, you anyway don't have a lot of data to analyze.  Now, I think what's a great question is like,  when do you need to start looking for a data team?  When do you need to start making that a priority?  Who will you give that priority to?  Is this going to be engineering, finance, another department?  And you need to have that in mind because it takes time, it's expensive.  So if you don't know that, you risk building it too late.  That's when troubles start.  Another thing is people should be relaxed about not having a data team at first  because you can do tons of things with a couple of spreadsheets,  just like common sense,  and just asking the engineering team to export some data for you  and that you can go to great things with that.  

Samaipata: What strategies do you use to build this data driven culture?

Emmanuel: The two things that come to mind,  one is,  the engineering department who eventually is going to produce the data  through the product, directly through the databases and so on,  they need to be aware this is coming  so that they can anticipate a bit and make decisions.  For instance, are we going to be event-driven?  Are we going to be database-driven?  What's our architecture?  And start thinking ahead,  okay, how will we build the platform, the data platform on top of that?  The second part is more in the business,  the operations side.  People need to start thinking about,  okay, what kind of data would help me solve that problem?  And that shifts the mindset.  Very often, give me everything you have and I'll use what comes up.  I don't think it's the right mindset.  You should start with a more minimalist approach of,  okay, here is my problem.  Here are the two, three KPIs I'll be looking at.  And that creates a better culture because you avoid asking  15,000 questions.  Okay.  You don't want to ask a question to your data team,  which will always be swamped, never give you answers. 

Samaipata: How do you measure the success or effectiveness of your data initiatives?

Emmanuel: It starts with who's asking the question and what problem they're trying to solve.  One thing I've found interesting is that it's very often super hard to compute  the ROI of a data team as it is for any support function.  What's the ROI of your finance department?  You mostly see them as a cost.  And that's it.  So what I recommend doing is more, okay, what problem am I trying to solve with data?  Is this going to be, I know I'll make a better product decision because I can roll out the right feature.  I know I can contact the good leads in my pipeline so that my sales will spend less time on the ones that won't convert.  I'm optimising operations, so I'm saving time, money, et cetera.  But focus on that problem.  Maybe you'll find a way  to put a Euro, dollar, like a monetary amount on it.  If you can focus on that specific thing, you'll find a metric to track it.  It could be the time people are spending.  It could be how many wrong decisions you made.  It could be how fast are you making the, are you evolving your product?  And that is usually fairly simple to measure.  The area where it's easiest to measure the efficiency of data projects is usually when you automate the data.  So you can do that.  And then you can do that.  And then you can do that.  And then you can do that.  And then you can do that.  And then you can do that.  So for instance, you use AI to automate moderation or you build a chat bot so you receive less tickets.  Most of the things that data teams do, it's fairly hard to put a good ROI on it.  So my recommendation would be don't go too much on the details.  Focus on what impact you have in your operations and stop there.  

Samaipata: What's your approach to data literacy within the organisation and how do you ensure everyone understands the database and data systems they're working with?

Emmanuel: My approach is people need to understand the high level blocks in the system, not necessarily all the specifics.  Have everyone clear on what the company KPIs are.  Is it revenue?  Is it cash?  Is it the number of clients?  Is it the number of contracts?  Everyone should know them.  Leadership should use them all the time so that education is repetition, so that they stick into everyone's mind.  Where is it produced?  Internal, external sources, third parties.  Manual operations.  And I'm not talking about listing everything, but at least that they know, okay, all of our data is internally produced.  We got nothing from the outside.  Or 50% of our data comes from a third party.  Or we buy 90% of what's very useful to them, to us.  The second one is consumption.  So who's using data in the company?  And that's fascinating because very often you have a siloed approach about the company.  So you know what you are doing.  You don't necessarily know well what others are doing.  You don't necessarily know well what others are doing.  So you don't know how they use data.  And the last part is people need to be aware when there are systems in the middle between the production and the consumption.  And that would be, okay, who's processing the data?  Who's responsible for what?  It's super important because one of the common difficulties of data teams is that they are usually the middle system.  And they get squeezed between the consumer that tells them, hey, your data is broken.  It's wrong.  Why didn't I not have my data this morning?  And the producers that sometimes just like to break things without knowing that they dump stuff into a hole and they don't really know how it's used.  Usually they don't consume their own data.  So that creates a bit of a problem.  How do you manage your data?  

Samaipata: What are your top three pieces of advice for a startup founder looking to get their data operations up and running?

Emmanuel: Keep it simple.  It's important because today in the market you have like 15 different tools to do any single operations in data.  And very soon you can end up with a lot of data.  And if you end up with a super complex stack to maintain, then you need people to administer them and to connect them and to debug them and so on and so on.  So a good example is that you can go pretty far by having a daily or a monthly extract of your databases put into at the very beginning, put into a few spreadsheets and build some reporting on top of that, like fairly simple graphs and so on.  

The second one is about: okay, what are your objectives?  You'll have a lot of data. You'll have limited resources, both in terms of money, people, time.  You'll have a lot of constraints because data is integrated in the whole system.  And so it's super important to be clear on, okay, in this phase for the next nine months, what we're going to do is focus on the foundations.  And so we are going to take relatively little tickets, requests on, hey, can you compute that number?  Can you compute this analysis?  And maybe then it's going to be okay.  We need to do something new that we didn't before.  It could be streaming real-time data.  It could be machine learning.  But you need to be clear on, okay, we're going to focus on a single problem that's going to eat up 60, 70% of our time.  The rest is going to be run, like operations.  But we're going to do one problem at a time.  The risk otherwise is that you just like to go all over the place and you don't have enough time to complete your projects.  

And probably the last is to allow yourself to fail because that's going to happen.

Latest News

See also

More insights to better the world through technology

The Founders Q&A

The Founders Q&A

As we strive for transparency at Samaipata, we want to shed some light on our investment thesis and on some of the key questions entrepreneurs often ask us. If the answers laid out below fit with what you’re looking for from a potential VC investor, we would like to hear from you! We review all opportunities sent to us.
Read more
AI is shaking up the software market, for the better

AI is shaking up the software market, for the better

The software industry will have to bring the power of AI to the broader economy and face a collective innovators dilemma in doing so. Traditional SaaS solutions were originally designed to simplify specific tasks within job descriptions and boost overall productivity. They serve as tools that augment human work but ultimately require a workforce to operate them.
Read more
The Hive Summit 2024: Samaipata Founders' Retreat

The Hive Summit 2024: Samaipata Founders' Retreat

Every year at Samaipata, we host our Founders' Retreat—The Hive Summit—under our Founders’ Success Platform, "The Hive." Celebrating its second edition this year in sunny Mallorca, the Summit brought together the founders of our portfolio companies, operating partners, and some of Samaipata's close friends. This retreat is designed to strengthen the bonds within our community, fostering meaningful connections and facilitating collaboration among all attendees.
Read more
Fueling startups with smart engineering

Fueling startups with smart engineering

In this episode, Stephan delves into strategies for optimising engineering productivity while keeping technical debt in check. He emphasises the importance of fostering collaboration between product and engineering, maintaining a focus on business value, and using key performance metrics to drive continuous improvement.
Read more

Leveraging the power of PR for startup growth

In the competitive landscape of entrepreneurship, the strategic deployment of public relations is essential for standing out from the crowd. Nadini Sathiyarajan, a communications expert and Operating Partner at Samaipata, brings a robust background from her leadership roles at companies like Spotify and Qonto, and is currently the Global Communications Lead at Sorare—a blockchain-based fantasy sports platform that has raised over $700M. 
Read more

Mastering the art of Sales at early-stage startups

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros.
Read more
arrow icon
arrow icon