On Systems and Strategy

A few months back, my wonderful and brilliant former colleague Bud Caddell posted a public kickoff for his Bucket Brigade project, and since then I’ve been thinking of some of the central themes that he presented.

Chief in my mind has been the idea that we have an incomplete understanding of the causal relationships within the systems we create [and the larger systems that our systems exist within (and so on)].

Which leads to water-cooler discussions that follow predictable patterns:

“Well, it would have worked if only ________ did ________.”

“The campaign should have been successful, but ________ totally shat the bed.”

So while I know Bud’s working on a much more expansive look at a similar topic, I’d like to explore the features of adaptable systems and put forward four key things that will help you design strategies that don’t suck. Or at least strategies that are built to succeed and get better over time.

Over a weird few months of “research,” diving into some pretty esoteric source material, four things came up that could/should be part of your work process and feature prominently in your deliverables:

  1. Clear, loud feedback & indication of system status
  2. Intentional flexibility for and planned reactions to failure and success within the system
  3. Agreed-upon indicators for the success or failure of components within the system
  4. A recognition and articulation of interconnectedness within the system

If you can honestly say to me that you’ve got all four of those bases covered, I’ll put money on your recommendation. If not, good luck.

Aside
For what it’s worth, I think the best strategists and thinkers do a lot of this instinctually, but precious few show their work. So what follows could possibly just be a method for making the way you understand/decide/act more explicit to those around you, be they clients, coworkers, or loved ones.

Clear Feedback & Indication of System Status
Like many geeky types, I imagine, after the Fukushima disaster I was inspired to learn more about nuclear power. That traipse through Wikipedia brought me to the above gem: a status spreadsheet for the Fukushima Dai-Ichi power plant, published here on a daily basis. The above is a map of the system a few days after the Sendai earthquake, indicating the different levels of function for key parts of the reactor facility.

For each reactor, the following parameters are listed:

  1. Operational Status
  2. Core/Fuel Integrity
  3. Reactor Pressure Vessel Integrity
  4. Core Cooling req. AC Power (injections of water into core)
  5. Core Cooling req. AC Power (heat exchangers)
  6. Building Integrity
  7. Water Level of the Reactor Pressure Vessel
  8. Pressure of the Reactor Pressure Vessel
  9. Containment Vessel Pressure
  10. Spent Fuel Integrity
  11. Cooling of Spent Fuel Pool
  12. Main Control Room Habitability & Operability

Each of these has a note and a color-code based on its significance. The reds are the ones that are severe issues and need immediate action. All of the above are essentially KPIs for nuclear reactor safety, and some are part of a cascading tree of safety relationships. Those relationships are shown in the image below (click it to enlarge). This particular situation/example isn’t really a tremendous definition of a clearly understood ecosystem of causes and effects, but it’s interesting anyway.

I’m not here to make any commentary on nuclear power, TEPCO, JAIF or any other related thing. I will say that I like the way the status is visualized here; it’s reminiscent of something my dad told me about: the “christmas tree” on a submarine.

This awesome photo (credit Telstar Logistics on Flickr) is of the christmas tree on the USS Pampanito. The lights indicate whether or not the hatches and valves on the boat are fully shut. If they’re all green, dive away. If they’re red, and you dive, you’ll sink and die a terrible death. Having hatches and valves shut is pretty important for a submersible.

Both the Fukushima table and the christmas tree indicate system status while focusing attention on critical areas. I think some sort of system qualification scheme for strategy should do the same.

The complicating factor for strategy is that many of the potential indicator lights that you could create wouldn’t be fed with clear information. It’s easy to tell if a valve or hatch is closed, and there are sensors in a reactor that can tell you about pressure levels and the like.

But what if the thing is difficult to measure? And while I’m sure that in the future everything will be measurable, in the near term some things just aren’t economical to quantitatively understand.

Agreed Indicators of Component Pass/Fail
Take dog shows for example. When it comes to deciding which pups best exemplify their breed, it’s not yet feasible to implement some sort of digital system that measures whether or not a Dachshund is more balanced than a Golden Retriever. But it is feasible to agree upon what constitutes the ultimate Dachshund or Golden, write it down, and hand that out to judges and breeders. In the case of disputes, suitably detailed descriptions should help reasonable observers and participants know with confidence why one dog won and the other lost.

Seems to me you could do apply that logic to nearly every action in a system.

Switching gears only slightly to tennis.

When you’re looking for players to practice with to improve your game, it’s important to find people at or around your skill level. Too great a skill difference and it’s only fun if the better player is getting paid for their time.

But aside from having every serious/semi-serious tennis player in the world play each other and report their scores back to a governing body to establish handicapping and rating, it’s pretty difficult for two tennis players that don’t know each other to understand how they stack up. So the United States Tennis Association (USTA) created something called the National Tennis Rating Program (NTRP).

The NTRP breaks down the game into six key areas that create success. And in each category, it provides a thorough description that makes it simple to self-assess. For example:

  1. Forehand: Very good consistency; uses speed and spin effectively; controls depth well; tends to over-hit ondifficult shots; offensive on moderate shots
  2. Backhand: Can control direction and depth but may break down under pressure; offensive on moderate shots
  3. Serve/Return of Serve: Aggressive serving with limited double faults; uses power and spin; developing offense; on second serve frequently hits with good depth and placement; frequently hits aggressive service returns; can take pace off with moderate success in doubles
  4. Volley: Can handle a mixed sequence of volleys; good footwork; has depth and directional control on backhand; developing touch; most common error is still overhitting
  5. Special Shots: Hits approach shots with good depth and control; can consistently hit volleys and overheads to end the point
  6. Playing Style: Very good consistency; more intentional variety in game; is hitting with more pace; covers up weaknesses well; beginning to vary game plan according to opponent; aggressive net play is common in doubles; good anticipation; beginning to handle pace

Performing up to those descriptions gets you a 4.5 rating. Interestingly, at the upper end of the table – 5.5 to 7.0 – they get more holistic, and indicate that the player probably doesn’t need a rating, as they’ll be playing competitively and their ranking will be more easily judged by their success against peers.

So it’s possible to blend multiple qualification methods into one system.

Ultimately, when it’s difficult or expensive to connect data to a success/fail judgment, know that there is another way: clearly defining success characteristics, and allowing for self- or peer-evaluation against those characteristics.

Flexibility for and Planned Reactions to System Fail/Win
Last Christmas, my wonderful parents gave me my grandma’s KitchenAid stand mixer. It’s a gorgeous, reliable machine, but was a little dirty. As I researched the process for cleaning the mixer, I found the above image, depicting (among other things) something called a “Fail-Safe Gear”. It’s the grey gear in the lower-right, and it’s made out of plastic. It’s designed to strip out if the contents of the mixing bowl get stuck, preventing the motor from overheating.

Genius.

It seems crazy at first to introduce a part that is designed to fail into a system, but when the components upstream of that part are expensive, hard-to-maintain or hard-to-replace, it makes a ton of sense. And while this is an engineering solution, to be sure, it’s a clear example of an established reaction to a common failure that prevents damage to the majority of the system’s components.

Another example of this (again from tennis) is the way ball marks on clay courts allow for clear challenge-resolution. No need for Hawk-Eye replays to determine whether a ball was in or out. Just check the mark.

But perhaps the most interesting example of this that I’ve found is the system of flight laws that exist in modern fly-by-wire aircraft (Airbus A320-on and Boeing 777-on). In older, simpler planes, control surfaces, engines and other important systems are controlled mechanically. In more advanced planes, pilot control inputs are fed through a set of computers that relay potentially modified commands to the engines and control surfaces, actively preventing bad shit from happening. When everything is operational, the pilot can’t accidentally or purposefully induce a stall, over-speed, or excessively roll the plane. Which is totes a good thing.

Warning: bad things happen to good systems.

Here’s how.

While all the systems in these modern planes are impressively redundant, a set of well-defined Flight Control Laws exist to cede control of the plane directly to the pilots as systems break down. Unfortunately, though, many pilots either aren’t prepared for managing this cession of control or aren’t well informed when the system breaks down.

Great example: AF447.

After diving into the murky waters surrounding the Air France flight 447 crash, I discovered this page outlining the parameters of all the laws programmed into Airbus’ flight control systems. In the case of AF447, almost all the systems had broke down, putting the plane into “Direct Law” where the pilots were in complete physical control of the aircraft. Where the plane would normally be protected from excessive Angle of Attack (AOA), excessive speed and lost lift, all of which seem to have contributed to the crash of the AF447 crash.

Unfortunately, though, the systems did break down: the speed-sensing-by-way-of-pressure-differential pitot tubes iced over, and without that crucial data point, the Airbus A330 began to — for lack of a better term — shut down, leaving the pilots completely in control of an airplane going too fast for conditions and rapidly approaching a stall.

It’s still a mystery what exactly happened in the cockpit, but the flight data recorders indicate that the pilots didn’t quite know what to do. It was the middle of the night, they were flying through storms that impacted their sense of direction, and they were receiving conflicting information about their attitude. The plane’s systems shut down to give them complete flexibility to return to a salvageable course, but the pilots either didn’t have the experience or the information necessary to do so.

The two principal complaints on the aviation boards are pretty cutting:

  1. A simple, and cheap GPS system would have been a serviceable backup system for the onboard computers
  2. More stick-in-hand manual flying time for long-haul pilots (apparently the guys at the helm had a total of 13 hours of unaided flight time in the A330/340 platform) is a necessary precaution for an automated system where the backup plan is to hand over control to the pilots

One cheap solution and one expensive solution, both of which I’m sure were considered and left out. Sad.

Several of the examples I’ve used here are from safety-first industries where systems thinking is an absolute necessity. The specifics of the Fukushima and AF447 scenarios are excellent examples of situations where the real-life conditions exceeded the maximum tolerances that the system was designed to accommodate. But it’s important to remember that the systems work pretty darn well most of the time; for their huge inherent risk, both nuclear power generation and large-scale commercial travel are exceptionally safe.

So how do we apply these lessons to the business of creating strategy where the intent is to sell more stuff?

Recognition and Articulation of Interconnectedness
The last, and perhaps most important step in creating a system that isn’t prone to breakage is to understand and describe how the component parts affect and are affected by each other. This has the fantastic potential upside of helping create a system that gets better under stress rather than worse.

One of the best examples of a clearly articulated system map came to me from the previously mentioned UC alumnus. It’s the Ryanair business plan, expressed as a network of interconnected nodes. Each node on the map is a critical part of their business – some important decision they made about how they were going to operate – and where appropriate the nodes are connected with directional, annotated arrows describing the affects certain decisions have on other decisions.

Funny thing: I’ve never seen this employed by anyone with the word strategist in their title. Nor have I seen it on the cube walls of the many offices I visit. Nor has it been provided in source material on the projects I’ve done. All this in the face of the fact that it’s a central part of the way that some dude called Porter understands strategy.

My assumption is that this doesn’t get done for two reasons:

  1. Talking about interconnectedness and pass/fail conditions runs contrary to the way that most businesses operate, no matter their scale. Most places – especially marketing organizations – just don’t do feedback well.
  2. It’s complex. Complexity is just plain difficult to deal with, difficult to understand, difficult to plan around. Doing all these things well would require a strong command of data, strong internal commitment to shared goals, and a belief in organizational adaptability. Name an organization that has all of those things throughout its ranks.

Predictably, most of the examples of effective interconnectedness of disparate systems/nodes are found in nature. And to my knowledge, we’ve had a pretty piss-poor track record of replicating the natural, successful complexity.

So, what next?
So: implement these methods. Parse the systems you create into specific nodes, and explain how they interact. Write down and agree to qualification schemes that allow you to understand when hard-to-measure things are successful or not. Build in failsafes that allow you to survive failure.

But most of all, make it clear, make it simple, make it explicit.

Comments

10 Comments so far. Leave a comment below.
  1. anon,

    so i wonder where the money went for the book he didn’t write?

  2. Come now, Bud’s still working on it.

    Books take a long time to write.

    Also, if you want to be mean, be mean with your name attached.

  3. Absolutely fascinating post Clay – superbly researched, and some great examples. I’ve thought long and hard and read up a bit on the AF447 disaster too. Have you read the Checklist Manifesto by Atul Gawande? That particular case looks like it was a black swan, but I believe checklists can solve a lot of problems.

    Thanks for the post!

  4. First off, thanks for the kind words and for pursuing this topic. It needs volume in our industry, and to Anon’s comment, I’m not breaking any speed limits getting the book out there.

    Anon, to answer your question as direct as possible, I still have just over 88% of the funds still in a money market account. So far I’ve taken two interview trips and have paid a research assistant weekly to help out. I’ve tried to do everything as cheaply as possible knowing I need to make every dollar count. Since I started writing I’ve had to make 2 cross-country moves, drum up freelance work, and take a demanding new job in a brand new city. If you funded the book, I apologize for the delay, but there were no time stipulations in the funding process, and first books just tend to take a long time.

    You obviously don’t know me well, personally. I take my commitments to other people very seriously. I’d ask you to not publicly judge the character of someone you don’t know well, especially hiding behind the veil of anonymity.

    • And just to be 100% clear, I haven’t spent a dime of the money given to me to write the book on anything that wasn’t directly related to writing the book. The only expenses so far have been travel related to interviews, a research assistant’s time, and source materials I’ve purchased. I hope that answers your question.

  5. Epic post. This deserves way more comments. People: comment like it’s 2008 already! Especially UC folks…

    I’m a bit confused by what you/bud mean by “systems.” I understand the idea of feedback, success criteria, graceful failing, and complexity. To me, these feel like things deployable to, I guess, anything. What makes them unique to systems?

    The 447 crash really intrigues me, especially given our conversations about the automation of transportation. It speaks, I think, to the inability of a system designer to account for every possible scenario. While their are back-ups to back-ups, I imagine that some freak accident based on a random permutation of conditions could result in catastrophe. I wonder if there’s a way for human-knowledge to play a great role–where the pilots wouldn’t be so surprised when a fully-automated plane decides to shut down.

    The action map doesn’t get done because it’s so freaking high-level. Unless you’re starting a new company, I’d imagine that it’s something crafted over decades.

  6. Thanks, Clay!

    Synchs up beautifully with where I am in Adapt by Tim Harford – Chapter 6: Preventing Financial Meltdowns or: Decoupling.

    What interests me about applying this thinking to our work is the challenge of creating safe places to fail. In my mind, this is the most difficult challenge standing in the way of companies adapting to a digital environment.

  7. Nice piece Clay.

    Of your “four things”I think point #2 is increasingly important. You can plan for response but it might create activity over there or back in that link. We all need to be quick on our feet.

    For #3 Any thoughts on assessing relative health of each part of the system where some just work or not. Maybe their quality is hard to assess? Like a wire… It’s just a vessel for the activity.

    Do you find you re-construct these as you go and learn and how are clients/ the senior team at UC in adjusting/ re-thinking?

    Cheers,

    NTW

  8. Nice post.

    One thing you did not mention that i think is important when considering complex adaptive systems (i.e. most of humanity) is the lack of predictability. Because you never quite know how one element of a system will predict another, it’s hard to predict with any real degree of certainty whether something will work. It makes a lot more sense to try 20 small ideas than it does one big one and to create an iterative process for seeing what works.

    Unfortunately, most agencies still seem fixated on the big idea not the many small ideas

  9. It would be really interesting to see these 4 concepts applied to our personal relationships…where the success criterion is how happy you are with someone (romantic, or just friendly, or even with family.) What could be a failsafe “submarine valve” for your relationship with your girl/boyfriend? I happen to experience this: When our written communication (over BBM, chat) exceeds our live, real life communication, bad things occur. We start not to understand each other well even on mundane diaogues. Wouldn’t it be nice to have an app (sorry for overusing the word on behalf of the internets) to remind you to talk to someone, when your mutual-chattiness level is exceeded? Hmm..Lemme drink some tea now, have a nice evening.

Add Your Comments

Disclaimer
Your email is never published nor shared.
Required
Required
Tips

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <ol> <ul> <li> <strong>

Ready?