Subscribe
 
Reading time: 8 min
PDF Logo Printable PDF

Brittle Points: How to make companies robust

Brittleness is when the company fails because just one component breaks. Learn some strategies for fixing Brittle Points.

Companies fail for all sorts of reasons. Sometimes we’re not surprised, like if the product never worked or no one cared about the problem it was solving.

But sometimes those things were great, and sales were starting to pick up, and then something broke, and it was all over. How does that happen?

  • The product was built on a platform, and the platform changed. A popular app drops to zero downloads after Apple builds it into iOS; a Google Workspace add-on drops to zero sales after Google builds that feature into Docs; a Twitter management tool breaks when Twitter removes functionality from their public API.
  • The initial marketing channel quickly saturated, so growth stalled at a non-zero but unsustainably-low rate.
  • The initial marketing channel was sustainable for a while, but got wiped away due to external forces: large bidders tripled the cost per click, Google’s SEO algorithm changed, the big industry event stopped happening, the link-sharing site became irrelevant, the hot blog lost its traffic, the magazine running the ads finally closed.
  • One big customer, representing 80% of total revenue, left. It wasn’t a mistake to sign that customer—it funded the company.
  • A key employee left the company. Early on, a 10x person can mint the company but also could be irreplaceable. A suitable replacement is too rare; it takes too long to find someone, convince them to join for almost no salary, and get them up-to-speed and productive.

I call these “brittle points”—places where sudden changes mean the company catastrophically fails, regardless how wonderful everything else is.

All young companies—and some mature ones—have dependencies like these. You can’t help it; you have to rely on other technology to build the product, services from vendors to deliver the product, and human beings to do work.

Let’s quickly see how engineers solve this problem with hardware (don’t worry, it’s not technical!), and that will give us clues about how to solve all the other problems in that list.

A brittle server

Suppose I have a single server that runs my website. Any number of things can cause this server to break—a power failure, a network failure, a bad configuration change, too much traffic arriving at once, bugs in the code, all sorts of things.

How do you make this system less brittle?

Consider power failure. Power can fail if the power supply1 inside the server burns up, or the power strip fails, or the power cord fails (maybe through a wetware failure like accidentally unplugging it).

1 This is the component inside the computer that receives the power cord; it converts city-power into the type of power needed by the other components.

We can address this Brittle Point by having a second copy of the power components—a second power strip with a second cord plugged into a second power supply. This is, in fact, exactly what data centers do!

In short: redundancy—having two things that do the same job. It’s twice as expensive, but it buys you robustness. This is also what you do with dashboard dials in airplanes.

But what happens if the city power fails? Data centers have their own gas-powered generators. Which means they also have to stock large amounts of gasoline. Gas-powered engines that are infrequently used have a tendency to stop working, so they also test and maintain those units. Data centers often have multiple generators. More robustness, purchased at significant, on-going expense.

In modern clouds we go yet another step further, because the entire data center is itself another Brittle Point. So we have additional, identical servers in a different, physically-separate data center that draws power and network connectivity from different vendors. But then you also need a system that knows how to direct traffic to only the servers that are currently available; another system! Which itself becomes a Brittle Point.

And this pattern continues—fix more Brittle Points, at more cost and complexity, sometimes creating new Brittle Points.

This example is applicable to all of the causes of failure above.

Neutralizing brittle points

“One platform” is brittle, because if they forward-integrate (i.e. copy you), or remove APIs that you need, or themselves fail, that’s the end of the company. One solution is to be multi-platform2. Another solution is to only build on platforms where you have a high degree of confidence that the platform owners are committed to supporting their ecosystem by never directly competing with them, and in fact promoting them. (Salesforce is currently the best in the world at this.)

2 Example: At WP Engine, we now run on all three major clouds—AWS, Google, Azure. Example: Instead of a tool for listening and responding on Twitter, you also support LinkedIn, Threads, and Mastodon.

“One marketing channel” is brittle, because if anything happens to it, that could be the end of an otherwise-healthy company. The solution is to layer on additional marketing channels, so that variation in any one of them is not fatal.

“One big customer” is brittle. One solution is a long-term contract with a serious breakup clause—insurance to bridge the time it will take to win replacement customers. Another solution is to prioritize sales until that customer represents a smaller percentage of revenue. Also up-front payments, so you have the cash-flow to invest in that growth right now. The typical attitude is, “We now have a large customer, so pour extra money into development to make sure we don’t lose it,” but the right attitude is to use that money to win other customers.

“One key employee” is brittle. Not only might they leave, but what if they just get sick or need to take a vacation? The usual refrain in the startup world is that none of these are options—everyone has to work 70+ hours/week to the exclusion of other things. Talk about brittle!

Solving these things takes time and money. Like the server example, they’re not free, and not quick-fix solutions. You can’t just hire three more fantastic developers to create a robust engineering team, and you can’t just snap your fingers and find three new efficient, productive marketing channels.

Therefore, the right attitude is to maintain a list of these risks and then periodically ask: Which one is best to work towards right now?

For example, it’s cheaper and easier to experiment with new marketing channels than it is to find, interview, convince, and manage a second software developer, and plus if you can get a second marketing channel online, that will generate revenue, which in turn means you can afford a second software developer. In this mini-scenario, the best thing is to focus all your energy on getting a second marketing channel working.

As you scale, the size of the “chunks” that create brittleness also scale, which creates new brittle things, and thus new risks and new investments. For example, with $260M in revenue in 2016, still growing at a blistering 60%/year, with a thousand employees, Hubspot was not brittle in any of the ways outlined above. But they recognized that they were a single-product company. At that scale, that’s a brittle thing: if there were a sea-change in the market for inbound marketing software, that could be fatal to Hubspot. It also limits long-term growth as the market matures and saturates. The way out—the redundancy—was to become a multi-product company, but not where one product is still 95% of revenue. They attacked that problem, and in 2024 it’s clear that they succeeded3.

3 See this article for data and discussion.

The biggest brittle point

Finally, on a personal note, there’s another “chunk-level” that’s even larger than all of the preceding, and it’s a brittleness that almost all founders suffer from, including myself. The chunk of “the entire company.”

This is a one reason why founders are almost always sad and sometimes permanently depressed even after a successful sale of a company. This was your identity, your life, for years. You don’t remember what it was like before this was you. You don’t have hobbies or even good friends anymore. You might have sacrificed family or health. Talk about a Brittle Point. Your entire life is a Brittle Point.

The solution here is not to have two companies or two jobs. That’s burnout; a lack of singular focus creates worse outcomes.

Rather, the solution is to realize that there were things you did and loved before and there will be things you will do and love after. They might not all be the same things. Sometimes it’s best if they’re not; you’re a different person now. You are inside a chapter in the book of your life. Even if one chapter is sad or has an unexpected twist, there’s the next chapter which you can look forward to, even if you don’t yet know how that story will unfold.

Robustness, not in many things simultaneously, but in things serially. That’s the wrong attitude for solving tactical problems at work, but it’s the right attitude for the arc of your life.

Back to today and the here-and-now. Go list all the Brittle Things you have today. Then tackle one or two of those things at a time—you have to manage risk, not try to eliminate all of it at once.

Be thoughtful, and build steadily away from brittleness.

☞ If you're enjoying this, please subscribe and share this article! ☜