Making systems

The concept development phase involves artifacts such as the customer objectives and concept of operations. We can now define each of these artifacts, but first we will address document management as a necessary supporting capability.

32.2.1 Document/configuration management

Each of the artifacts worked on and produced in the concept development effort should be placed under document management. The document management system should provide:

A project should establish a document management system early in the initial concept phase. The concept will be represented in the artifacts listed below, and when these artifacts are reviewed and approved, a baselined version of each should be available in the repository.

Organization. Ideally, a project will designate one tool for storing all electronic information, and organize the documents stored in that tool so that it is convenient to find each kind of document. In practice most projects use different tools for different kinds of artifacts—a source code management system for software, a document system for ordinary documents, design repositories for hardware designs.

A document does no good if the people who need to use it cannot find it. A project must provide a single starting point for finding documents, whether those are stored in a single tool or spread over multiple tools. The contents of each repository must be well organized; we have too often seen projects build up a long, long list of documents, each with a document number and unhelpful title, requiring users to scroll through the list or guess at search terms. Creating an index that organizes the artifacts by the relevant phase and component helps people significantly.

Repository organization takes effort. We recommend making at least one person explicitly responsible for maintaining the organization in the repository, maintaining indexes, and (if necessary) updating the organization to address how people actually use it.

This means that the repository should:

Provide a single starting point for anyone looking for an artifact
Provide indexes or organization that guides a user to the artifact(s) that they need

Versions. The tools for storing artifacts must be able to maintain both a baselined version and multiple working versions.

The baselined version must be clearly identified as the baseline, so that people know what the official document is. A baselined version must also be immutable: it has been approved as a stable version. The baseline version should be replaced when a new version is approved as the baseline.

Working versions, on the other hand, can be updated often. People store working versions in a repository for multiple reasons: to preserve a copy of the work in case their local copy is lost or damaged, to share work in progress with others, and to provide a version as a proposed new baseline. People may be working on different changes concurrently—one person addressing one change, while another person works to address some other issue.

This means that the repository should support:

Multiple working versions, for different people and for different concurrent work on updates
- Working versions should be easily mutable, so that someone can store updates regularly
- Working versions should be sharable between people, both for shared updating or for comments and review
A single, clearly-identified baselined version of each artifact, separate from any working versions
- The baselined version should be frozen
- The baselined version can be replaced by a new baseline from time to time

Approvals and workflow. The team relies on the integrity of baselined artifact versions. Any updates to the baselined version should, therefore, be carefully controlled. The typical workflow is that someone develops a working version of the artifact, then proposes it for a new baseline. The proposed version then gets reviews, and is either approved to become a new baseline or is given issues that need to be addressed before it can be approved. Once approved, the proposed version is promoted to become a new baseline.

Every project needs to have a clear, written procedure for this workflow. It should be clear to every team member how they go about proposing a working version to be baselined, how the review and approval steps are performed, who is responsible for approval, and the steps required to turn a proposed version into a new baseline.

Some artifact repositories provide support for these workflows. Software repositories, for example, provide functions to create branches (working versions), and to control the process where a branch is merged into the master branch (baselined version). Other tools provide a general workflow functionality that one can use to implement and enforce these steps.

We have seen some projects that do not use automated workflows, instead having a well-documented manual procedure for each of the steps. While this can be error-prone and while it does mean that one or more people must be responsible for managing the repository contents, this approach works well as long as the team is not too large and no more than a few dozen artifacts are being managed. This is especially useful when a project is starting up and has not yet determined what tools they will be using.

Finally, we noted earlier that sometimes it is important to update the baselines of several artifacts at once so that they stay consistent with each other. For example, consider when a customer requests a new function be added to the system. The new function must be added to the customer objectives document. The customer objectives and the concept of operations will then be inconsistent: the objectives will include the function, but the CONOPS will not. Someone will then need to update the CONOPS to add the function, followed by reviews and approval. It can be best to baseline the updated customer objectives and CONOPS documents at the same time, once they have both been updated and the updated CONOPS has been approved.

The repository, thus, should:

Have a clearly documented workflow for updating baselines

It is desirable, but not required, that the repository:

Provide automated support for the workflow of proposing a baseline, getting reviews and approvals, and updating the baselined version
Provide a way to update multiple artifacts together

Other considerations. The previous sections have outlined the functions that a repository should provide. Effective artifact management requires some other capabilities.

These capabilities include:

Access control. Different people will have different privileges while using the artifacts; one person might have approval authority for one kind of artifact, the authority to edit working versions of another kind, and only the authority to read yet another.
Security. The team spends effort and money developing the artifacts. They should be protected against being tampered with, lost, or leaked without authorization.
Disaster tolerance. The artifacts should be protected against non-malicious damage, such as a server crashing or a work site being lost in a natural disaster.
Audit and non-repudiation. For high-assurance system development, where safety or security of the developed system are important, the ability to review after the fact every change made to every artifact, and to identify the person who made the change with certainty, is usually required. This places a high bar on the security of the repository tools—one that few readily available tools support easily.

In addition, the repository will work in conjunction with issue tracking or change order management tools. Those will be discussed in a later chapter.

Sidebar: Life cycle of a document

Each of the documents we have discussed will go through a sequence of steps:

The development of the initial version. In this step, the document will be changing regularly as its contents get worked out.
Initial baseline. This is the step where the document is treated as “finished”. This step often involves reviews and approvals. People can treat the baselined version of the document as stable; developing derived artifacts from the document is low risk because it isn’t in flux.
Identification of a potential change, and development of a revised document that addresses the change. During this step, two or more versions of the document will be available to the team: the most recent baselined version, and work-in-progress versions that are in flux.
Baselining a revised version. The revised version gets a review and approval, and is marked as the newest stable baseline version.

At any time, there is at most one most recent baseline of a document.

Every project needs to have two capabilities to support these changes: configuration or document management, and a way to work out the effects of a change. We will discuss these in an upcoming section.

Part VIII: Specifications

Chapter 33: System purpose

33.1 Introduction

A system’s purpose is the set of things that it should achieve for different stakeholders (Chapter 9). The purpose is a record of the reasons for investing resources and time into making the system. The purpose gives direction for all the work developing a concept and design for the system. A system is acceptable if it meets its purpose; anything in the system that does not contribute to the purpose is waste.

Developing the system purpose is part of the reference life cycle, as discussed in Section 28.3.

The development of every system and every component should begin with establishing its purpose. Understanding purpose is the first step to working out the specifications for a system or component. Trying to develop a concept or specification without working out purpose usually leads to developing the wrong thing. In Section 9.2, I laid out why it is important to work out purpose first.

The purpose for an entire system is likely to be complex, and it will take time and effort to find out the purpose. The understanding of the purpose for a system—or a complex component of it—is likely to evolve as time goes by, because people won’t realize everything they want all at once. The full understanding usually comes iteratively, as stakeholders review system concepts and discover something they missed.

Some components have simple purposes. The purpose for a bolt, for example, is just to connect some physical components and transfer force between them. Knowing what those components are, what they are made of, the environment the assembly will be in, and what forces the bolt must handle are most of the purpose. This kind of purpose does not need a drawn-out investigation or a complex record. It does still need to be established and written down, however.

The input to developing purpose is usually vague and unorganized. It includes general ideas for a system and who might be interested. It proceeds through discussions with potential customers and background research on them. It can include market research when a system might be built to sell to as yet unidentified customers. It can also include documents from a potential customer, such as a request for proposals (RFP).

The output of purpose development is a set of documents and other records. The primary outputs are a statement of who the system stakeholders are and what they need or want. The output also includes the unedited, uninterpreted records of what customers or stakeholders have said, so that the final record of purpose can be checked against the original information. Maintaining a record of how the team has learned to interpret what a customer said also helps check the final result.

The purpose is different from a specification or requirements. The purpose is a collection of statements of needs and wants. It will become the source for requirements, but the purpose is more high-level than requirements and does not try to have the same precision. The purpose is usually imprecise where specifications are precise and verifiable. Requirements specify everything necessary about a system or component, while purpose is limited and does not reflect decisions about system or component structure.

Parts of this process will repeat as concept, specification, and design proceed. When parts of the purpose change, people need a clear indication of the difference between the old and new purpose so that they can determine what needs to be changed in concept, specification, or design to meet those changes.

33.2 Why do this?

I have been told on more than one project: “We can’t afford to do this up-front stuff. We need to move fast and get something working.”

My first response has been along the lines of: “Moving fast to get where?” The second response is: “You clearly have some idea of what you want to build. What is that idea that you already know?”

There are five reasons to take the time at the beginning of a project to work out the system purpose.

There are occasional exceptions to each of these, but they are not common for complex systems.

The Pontiac Aztek SUV offers an example. It was an automobile marketed by General Motors from 2001-2005 (model years). It was designed to appeal to customers in a new market segment for smaller SUVs, and the company put effort into making it new and exciting. It was a failure for GM: it sold poorly and never met break-even sales. It was often described as one of the ugliest cars ever produced.

The company went ahead and produced the vehicle. It has been reported that the team worked hard to develop the model faster than had been typical of the company at that point. By some accounts, the project’s execution went smoothly and met its milestones. Nonetheless, all the effort was in vain.

The Aztek project shows all three reasons for spending time working out purpose. The project believed they knew what people wanted better than the customers they surveyed did, and they were wrong. The company failed to build relationships with potential customers who were telling them that the car was not attractive. And once the car was available, the public stigma was so great that even the changes they made in subsequent years and marketing efforts did not help sales.

This example is far from unique. I discussed other projects that did not fully work out the system purpose in Sections 4.1, 4.3, and 4.4.

There is one case where the principle of establishing a customer’s purpose for a system does not apply: when building a new kind of system for which there is not yet a customer. For example, a technical demonstration system is used to learn about the technical problems of a potential system design approach before committing to making a product using that approach. The Boeing 367-80 (the “Dash 80”) in the early 1950s is a good example [Thompson87]. It tested a design that evolved (separately) into the 707 passenger aircraft and the KC-135 military tanker aircraft—but neither of those aircraft were exactly the Dash 80. The Dash 80 tested new wing and engine pod designs, thrust reversers, landing gear, and more. The Dash 80 also gave flight demonstrations to airlines that showed that the aircraft was up to the task of carrying passengers economically.

Technology demonstration project still have stakeholders, even if they do not result in a system that will be used directly by customers. Funders are still providing resources to build the system, and they expect a return on that investment, such as learning enough that the organization can build a “real” version as a product. Regulators may have a stake in how the system operates. The project may have to make a case to the funder or others in order to get the investment for the work. In Chapter 26, I discussed making the case for a project.

33.3 Different kinds of projects

The purpose is all about knowing what the customer and other stakeholders want. But who is the customer? How does one learn what they want? There are multiple answers to these questions, and how one learns about purpose depends on those answers.

33.4 Investigating purpose

The process of coming to understand the purpose for a system has a few steps. These do not proceed neatly linearly; they overlap and repeat. The purpose of the process is to identify who all the stakeholders involved in the system, and come to an accurate and complete understanding of what each of them needs. The process of understanding each stakeholder itself has multiple steps. After one works out what each stakeholder needs for themselves, the last step is to synthesize a common purpose out of all of the stakeholders’ needs.

The task of understanding a stakeholder’s needs will only generate useful results if the results are accurate. This means that the team is careful to separate what the stakeholder actually says from what the team thinks they will be saying. It means that the team does not fill in gaps in what the stakeholder says—they may ask the stakeholder to fill it in, but the team does not supply the information. Most of all, the team works to avoid confirmation bias; they treat their understanding as it evolves as a hypothesis and devises ways to check whether the hypothesis is correct or not.

There will be the temptation to get too specific and try to make specifications while investigating purpose. Don’t do that; at this point in a project, the task is just to record what the system should solve. This information will get combined with a concept and more investigation with stakeholders to make specifications. Some customer needs will be rejected as not something this project should address. The needs from multiple stakeholders will be reconciled. All this happens before getting into details of concept or specification.

33.4.1 Who are stakeholders?

The first step in working out the purpose for a system or a component is to identify who the stakeholders are. A person or an organization can be considered a stakeholder if they can say “no” to the project or to the system and stop its progress, or if they are providing resources to the project to build the system.

I discussed stakeholders in Section 16.2. That section provides a general list of stakeholders and their potential as a starting point for working out the stakeholders for a particular project. That list includes the customer, the team, the organization, funders, and regulators as example stakeholder roles.

It is helpful to record the list of stakeholders, not just think them through. It is also helpful to record who or where to get information about each stakeholder. For a customer, this might be the list of contacts who are available to answer questions. For the organization running the project, this might include pointers to organization policy documents.

In the beginning, some stakeholders are theoretical. A project developing a visionary product (as defined above) does not have specific customers to talk with. Instead, the initial task is to identify market segments that might have potential customers, and work out how to learn about those segments. A project might not have funding at the beginning. The first step is then to identify potential approaches to getting funding, and how to learn about each one. This might be anything from obtaining venture funding to getting small business innovation contracts (SBIR in the US) to getting budget from the hosting organization.

Some stakeholders are not obvious and not in the lists I have presented so far. For example, in many cases there are groups or individuals who have influence over a decision to approve funding for a project. They may not be named officially in a funder’s decision process but they have de facto influence. These people are also stakeholders and their needs must be considered in order to get funding approved.

33.4.2 Who is each stakeholder?

Getting to know a stakeholder and how they work are the second part of working out purpose. This is the time to both learn about who they are and to develop a working relationship with their team. Stakeholders are not all alike: each venture funding organization has its own culture, decision processes, and objectives. Each potential customer has a different objective and works in different ways, even if those differences are sometimes subtle.

A systems project aims to satisfy stakeholder needs. A project team works out how to communicate with the stakeholder in order to learn about their needs. They also learn how to communicate so they can make the case to the stakeholder how the needs are met and potentially get their approval. The team also communicates with the stakeholder when there are problems that affect meeting stakeholder needs.

There are several kinds of information that a team might want to learn about a stakeholder, and especially a customer.

This information can come from many different sources. Some of it may be spelled out by the stakeholder in a contract, a request for proposals, or in regulation. Some of it can be asked about. Much of the subtlety, however, has to be learned by observation and perhaps by talking informally with others who know the stakeholder.

The information about each stakeholder can be recorded in a prose document. Some organizations may have customer relationship management tools that will help capture some of the information about a customer.

RFP-driven projects. When deciding to respond to an RFP, the team must learn what acquisition rules the (potential) customer is using in order to determine what restrictions there are on how to communicate with the customer. In many competitive acquisition cases, the RFP must be the only official source that a team can use, so that all proposing teams work from the same information—thus treating all teams equally.

The team must also learn how the customer makes decisions, including who makes the decisions, who influences the decisions, and how the decision will be made. When responding to a commercial RFP, this can be easy: there is a contact who sends out the RFP and who can answer questions as needed, there is someone they work for who reviews and decides whether to accept a proposal or not, and the decision is based on what the decision-maker thinks meets their needs at the best price. For a US Government agency RFP, on the other hand, the decision process is defined by Federal Acquisition Regulations and by the agency’s supplemental rules. There are formal processes for submitting questions; there is typically a defined scoring and weighting system that a formal review team must use to rate each proposal.

When the customer is doing a competitive acquisition, the team also needs to gather information on the other teams that may be choosing to submit a proposal. This information helps shape the proposed design and the proposal itself to make them look better than the competition to the customer. This can include relative strengths and weaknesses of the other teams, such as whether this team has proprietary technology that will do a better job for the customer (a weakness of the other team), or whether the other team has more flexibility in pricing (which might be a strength of the other team). This information should be gathered into a competition document.

In practice RFPs are rarely complete or unambiguous. This is because they are written only by the customer, and there is little opportunity for dialog so that the customer can get alternative perspectives and check that their work is clear and complete. When it is possible, the team should engage in the kind of dialog with the customer that they would in a customer-driven project in order to confirm their understanding of what the RFP says and to flesh out the request to include a more complete picture of what the customer actually needs. When this is not possible, the team should find people who can accurately represent the customer’s way of thinking and needs, such as people who have a similar position in a different organization in the same industry, or someone who has worked closely with the customer in the past and knows the business or people involved.

33.4.3 What does a customer or stakeholder want?

The next step is to work out what each stakeholder wants or needs. I will present the rest of this section in terms of a customer, but this applies equally to any kind of stakeholder.

Some stakeholders have objectives to be met, such as a customer wanting a business function performed or a funder wanting a return on investment. Some stakeholders impose constraints, such as a regulator imposing safety rules or the organization limiting the amount of resource that can be allocated to developing the system.

Working with a stakeholder is full of pitfalls. The most serious is that the team will interpret what the stakeholder is saying in a way that the stakeholder does not actually mean. The team can bring their own interpretations to understanding what the stakeholder says; the result can be a system purpose or design that doesn’t meet the stakeholder’s actual needs.

The intent is to learn from the customer or other stakeholder. This is a time when the team must be careful not to be creative: they must exercise care to keep what they interpret or respond separate from what the customer is saying. The team’s interpretation comes later, and only once a solid basis of fact has been established.

The end result should be as complete and accurate understanding of the customer’s needs as is possible. Complete means that everything important about what the customer wants or needs has been included. Accurate means that the record contains only what the customer wants, and nothing else—no reading between the lines, no extrapolation, no filling in gaps. Knowing where the gaps are is important information that the team needs as they develop a concept and specification for the system.

Stakeholder speak in terms they are used to, and not the language of systems. Someone must be able to translate between the customer and the system-building team. A stakeholder should not be expected to do that; they are responsible for doing their work. Some stakeholders may have some people on their team who have systems experience, and they can be a helpful part of the process—but no systems-building team should expect that their team will have someone like that available. I have seen systems projects handle translation two ways: by having a few team members learn about a stakeholder’s business, or by bringing in people who already have experience in that business and systems-building.

There are many ways to elicit information from the stakeholder. I will discuss those later in this chapter.

Some of this information can be gathered from research materials. A customer that has issued an RFP should have documented most of their needs. Regulatory requirements can—theoretically—be understood by reading the appropriate regulations. A funder may have documented their expectations in a funding agreement.

The general process for learning stakeholder needs goes through three rough phases. In practice the process does not proceed smoothly from one phase to another; rather, it involves lots of looping back and moving forward on one topic before another is worked out.

The first general phase is to listen to what the stakeholder says they need or want. What problem do they want to solve? What capability do they want to gain? Why do they want to do this? The project team can prompt this discussion by asking a few open-ended questions: “Can you tell me about your business? What are the difficulties you are having or improvements you would like to make?” At the initial stage, before it is certain that the team understands the stakeholder, it is inappropriate to ask about specifics. The systems team must not presume that they know what the stakeholder wants until they have done the work to ensure that they do indeed understand the stakeholder.

During this phase, the team does little or no prompting. They should ask questions to clarify some point they aren’t sure they understand, and periodically read back what they think the stakeholder has said to confirm that they have heard it correctly. The point is to come to understand how the stakeholder sees their needs, including learning where the stakeholder—especially a customer—has not thought things through yet.

Throughout, the project team working with the stakeholder is recording what they hear. To the greatest degree possible, this record should be made using the stakeholder’s own language. This means that the project team may need to learn the language that the stakeholder uses and the assumptions they make, or include someone who can translate. The stakeholder should see the team making an effort to keep a record of what they learn; this conveys to the stakeholder that the team is taking them seriously. It also means that as team members cycle in and out of the project, the new people can learn about the stakeholder from the records and they do not have to ask the stakeholder to repeat over and over things that they have already said.

After the team has gained some understanding, it can be time to ask the stakeholder to expand on things they have said. An initial explanation often leaves gaps unexplored; this is the time for filling in gaps in what the stakeholder has reported and getting some depth of understanding. This is a time to prompt the stakeholder about assumptions they may not have articulated (and may not even normally think about). This is also a time to learn about industry norms that they operate within, which customers often take for granted and forget to explain to others.

Adding depth to the understanding includes things like who major system users would be and what they would do with it (informal use cases). This includes learning the motivations why a customer values those functions. It also includes learning about the context for a system: what environment it operates in and what other systems it will interact with.

Filling in gaps means finding functions or system aspects that the customer has not been thinking about. One may have some relevant experience that may suggest there are more topics to be checked. This is the time to ask questions like “I understand from what you’ve said that X and Y. Is there some Z that connects them?” Or “I’ve worked on some similar projects that have an issue with X. Does something like that apply to what you’re doing?”

Learning about how the customer expects the system to be used can expose some of those gaps; for example, if a customer expects that users will access a computer system, there might be a gap in how the users will log in, how they will be authenticated, and how their access might be controlled—security-related capabilities. The project team might prompt some of these discussions by asking an open-ended question like “what security risks do you seen in user access?” Other gaps can be exposed when tracing through how some use case might work; the customer might mention that the system then sends a record of something to some other system—prompting a discussion of what that external interface is.

The third step gets into more detail, especially about unusual situations. Customers often focus on the good cases and don’t think about what happens when a user does something they shouldn’t, or when there is an emergency, or when something fails. They often don’t think about how the system might need to be evolved, and what responsibilities they may have when the system is retired. One person I talked to advised looking for all the “it will never happen” scenarios—and that “the odds of them occurring are directly proportional to the fervor with which they user swears they won’t.”

The project team will likely prompt much of the discussion in this third step, because it is about filling in information that a customer does not think about much.

Obviously, this process is a gradual process of starting with only what the customer says and gradually prompting them and asking them more and more questions to get a fuller understanding of what they need. Questions late in the process will occasionally reveal major new functions or use cases that are important, even if they weren’t at the top of the customer’s attention. When these are discovered, the project team needs to go backward in the process to learn more about this newly-discovered aspect of the system, see how it changes their understanding of what has already been discussed, and eventually get into details about the new aspect and about how it affects other parts of the system.

At the end of the process, the team should have confidence that they understand the customer correctly. This requires validating their understanding with the customer. This should be done continuously: regularly asking questions to confirm understanding, for example. The team should also ask the customer to confirm the completed record to ensure that the customer agrees that it is complete.

Visionary projects. A visionary project, as I am using the term, is one where the system being designed and built is not for a specific, existing customer. Instead, the system might be marketed to several potential customers down the line, or the system might be part of a strategy to change an existing market or create a new one, thus creating new customers who may not even exist yet.

Consider building a new commercial passenger transport aircraft. The air transportation system is mature, and so one can name who buys these aircraft: airlines, aircraft leasing companies that provide the aircraft to airlines, businesses using aircraft for private transportation, and government organizations that fly passenger aircraft. No aircraft company in recent decades has built a new large passenger aircraft to be sold only to a single customer; instead, the companies work out the needs of many potential customers and design an aircraft that will be good for many of those customers. Since airlines come and go often relative to the lifetime of an aircraft design, many of the potential airline customers do not yet exist when the aircraft company has to decide on the capabilities of the new aircraft. This is a case where the market exists but there is not a single customer to satisfy.

In contrast, consider the first generation of global satellite data and telephony networks (such as Iridium and Globalstar). When they were being designed, there was no mass market of ground-to-space mobile communications. These companies, and others that did not end up deploying their networks, had to work out who their potential customers might be and what they might need. Indeed, all of these first generation providers went bankrupt at some point as they developed both their network systems and at the same time built up a subscriber base. This is an example of a project that was creating a new market.

In both these cases, there is not a single definition of a customer. Instead, the team must determine the market—the set of customers—who might want the system. The team looks for the set of features or capabilities that will satisfy a large enough market to be worth supporting. The plan will often be to start with a small market segment and grow over time by adding capabilities to satisfy more people, while having learned more about the first set of customers and gaining some revenue to help fund growth.

All this information will need to be collected from a number of sources, including market analysts, surveys of potential customers, and the experience of people who have worked in related industries. Finding people or organizations who can act as a proxy for a class of possible customers is helpful. It is important to gather from multiple sources in order to cross-check the information and to account for sampling bias that can happen if information comes from only one perspective.

The information about the target market segment(s) will change regularly over the course of the project as customers come and go, or as new opportunities appear. This means that the design and implementation of the system will likely need to adjust as time goes by. This also means that the team needs to continue to survey the market and talk to potential customers.

At the same time, it is a rare project that can successfully chase arbitrarily changing customer objectives. The design and implementation team needs enough stability that they can complete a version of the system. Marketing and sales teams need stability so that they know what they can actually sell to a customer. The stable version of the customer objectives should be baselined. Changes to the baseline should occur only periodically, when the team decides that either there is a change in the understanding about customers that is vital to reflect in the design of the system right away, even at the cost of delaying the system being ready for use, or when there is a change that does not delay or significantly change the system being designed and built right now.

The idea of a minimum viable product (MVP) is fashionable in recent years. The general approach is to create the simplest system that will meet the needs of just a few customers, put the team’s focus on building up that first version, then plan on adding capabilities as time goes by to make the product attractive to more customers. This is an example of planning how to handle changes in understanding what customers want.

Visionary projects can expect that there will be competition with other teams’ products. Indeed, customer choice is a fundamental precept of the Western market system, and often required by regulation. A team should develop a record of what their competition might be, whether that is another organization offering a similar product (as happens with large passenger aircraft), or whether a customer could meet their needs a different way, or whether customers will choose no to buy a new product and live without its benefits (which is common with new technology trying to create a new market). The team should also build up an analysis of what sets this team’s system apart from alternatives—why a customer would choose this system over other options. Maintaining competition document with this information will help the team make decisions about changes to the customer objectives or business objectives around which the team is designing the system.

33.4.4 Customer objectives

The customer objectives record what a customer (specific or hypothetical) wants out of the system. They serve as a proxy for the customer throughout system development, rather than having every developer talk directly to the customer to check their specification or design.

It is important to separate the customer objectives from other objectives. I have seen projects include business objectives, like profitability, in the list of “customer” objectives. I have also seen teams include internal technical objectives like being able to reuse parts of existing designs. Doing so creates confusion: is an entry in the list of customer objectives actually something the customer wants, or is it something that the customer doesn’t care about? There will come times in the development process when hard decisions must be made about some part of the system design. At those moments, the team must be clear about what is actually a customer need and what is an internal need. If a customer need can’t be met reasonably, the team needs to talk to the customer to resolve the issue. If an internal business or technical objective is proving hard to meet, the decision should be handled internally and the customer should not be involved—they don’t care or know about the issue.

For some customers, working in terms of use cases will be familiar. While documenting use cases is helpful, it cannot capture all of the information from the customer in their original language. Resist the temptation to document the objectives as formal use cases unless the customer is providing information that way. Formalization comes in the system concept, which derives from the system purpose.

33.4.5 Funder and organization business objectives

Funders (Section 16.2.4) and organizations (Section 16.2.3) provide the resources for a project. Each of them have reasons they are willing to make an investment in the project, and so the project must in return address those reasons.

A funder provides resources to the organization that runs the project. The funder then expects some kind of return on this investment. That return can be financial or intangible, but they will only invest if they have confidence that they will obtain the expected return. The confidence in turn comes from the plans and strategies that the organization provides to make the case for investment.

An organization that is devoting resources to build a system must be able to obtain those resources. At the minimum, the organization must be able to hire and pay the people who design and build the system; it must be able to pay for the tools and prototypes it uses; it must be able to pay people to gather customer objectives and work with regulators and all the hundred other tasks involved.

Most organizations are also building an ongoing business, not just coming together long enough to build one system and then disbanding. Sustaining a business requires obtaining funding, getting sufficient return on the work the organization does in order to fund continuing work, and building capabilities that allow the organization to keep building or maintaining systems into the future.

All these imply that an organization needs to have a business strategy, which leads to business objectives. The organization may have a strategy of developing a product line that serves a wide variety of customers. This might translate into an objective to build a simple initial system product that is able to generate X revenue, and that can be extended over time to address the needs of more customers.

Many organizations develop these objectives at the executive level but do not feed the information downward explicitly to the team who must design a system. This is a problem because the design team knows that such objectives exist but don’t necessarily know exactly what they are, and thus can’t make accurate design judgments. We have seen, over and over, questions in a design team like “should we design this board with extra capability now, or design the minimal board and replace it later?” These have often led to arguments because the design team did not have the information needed to make a choice between a higher up-front investment cost for extra capability or incurring cost later in a redesigned board.

These business objectives change continuously. When there is a proposal to change the objectives, the team must follow a disciplined process to determine what the effects of the change might be. This involves tracking down how the change will affect technical requirements and designs, which in turn affects whether the changes will affect the system’s ability to satisfy customer, regulatory, safety, or security needs. Changes to the design will also affect development cost and the time required to bring the system to operation. Sometimes a change to business objectives will make sense: changing the rate at which the system should scale up after the initial operational version may not affect the development time much but will increase customer satisfaction. Other times a change will have negative consequences: setting the goals for the size of the addressable market too high too early may require a higher development budget and longer development time than is available. Making a well-informed decision about these changes is only possible if the team can determine what the effects of a potential change in business objectives are.

33.4.6 Regulatory objectives

Many kinds of systems are subject to regulation. Some systems require licensing or certification, to prove that they meet regulations; others only need to be able to show compliance on demand.

These regulations pose constraints on the design of the system. Some are simple: aircraft emergency exits must be marked in particular locations. Some are complex: the crew of an aircraft must be able to properly determine what is happening with an aircraft even when there are complex failure situations—which involves human factors as well as the design of aircraft sensing systems.

A system will not be able to be put into operation unless it can meet these regulations. This means that the regulations must be incorporated into the design, just as the functional desires of the customer must be. One cannot do this unless one knows what the regulatory constraints are, and so one must search out and document all the regulations that apply.

The regulatory objectives documentation should at minimum list the source regulatory documents that apply to the system. Before design validation is complete, the information in the regulatory objectives document must translate into a detailed collection of requirements against which the system can be checked.

It is often necessary to involve either experts in the regulation of a particular industry or the regulatory agencies themselves to properly gather all of the regulations that apply.

Aircraft regulation. Aircraft regulation is focused on managing the risk to aviation non-participants (such as people on the ground) or casual participants (passengers on board an aircraft). The body of regulation is complex, taking a number of different approaches to both protect people in general while allowing those who can take responsibility for aircraft behavior the maximum feasible freedom to do as they need. This results in a combination of rules: licensing of aircraft types, constraints on where different kinds of aircraft can be flown, pilot training and certification, air traffic control over where aircraft are flying, and many others. It requires the combination of all of these rules to meet the objective of controlling risk to the public.

The regulations that apply to aircraft in particular (as opposed to the larger aviation system) begin with classifying the kind of aircraft by the risk it poses. Ultralight aircraft are lightly regulated, primarily defined as a low maximum weight, speed, stall speed, and so on. Pilots either do not need a license or only need a limited license for ultralight aircraft. They generally can only be flown in daylight. There are intermediate kinds including those for general aviation, aerobatic and utility aircraft, commuter aircraft, and finally transport aircraft. Each category has limitations on its weight, speeds, number of passengers, acceptable pilot qualifications, and allowed maneuvers. The restrictions increase as the number of passengers, weight, and speed increase because each of these induces greater risk to the public.

CAAs throughout the world have encoded the regulations for each category of aircraft. In the US, for example, the regulations for transport aircraft (the largest category) are defined in the Code of Federal Regulations, Title 14 (the FAA), Part 25 (Transport category airplanes). Other parts of Title 14 cover topics like airports, the structure of airspace, air traffic control, carriers or operators, and navigation facilities; these other parts define the environment in which the aircraft will operate.

Most kinds of aircraft require a type certification. This is issued by a CAA to show that the CAA has verified that the aircraft’s design meets all these regulations. This is the first enforcement mechanism used to ensure that an aircraft complies with regulations. There are additional mechanisms, including registering individual aircraft and periodic inspection of the aircraft and its records by CAA-authorized auditors. The final level of enforcement comes from air traffic control granting permission to fly or not.

There are some regulations that apply to aircraft that are not typically handled by a CAA. This includes radio communication, which is typically regulated by a national communications authority (in the US, the Federal Communications Commission) and harmonized worldwide through the International Telecommunications Union.

Spacecraft regulation. Unlike aircraft, spacecraft do not have a unified regulatory regime. This is in part because there is no single unifying principle behind the regulations, as there is for aviation (safety of the public). Most spacecraft pose a negligible danger to the public during operation, as they are small enough to be destroyed when they re-enter the atmosphere. Historically, there has been concern about the military value of the information produced by spacecraft; more recently, there is increasing concern about the dangers one spacecraft poses to other spacecraft.

At the time of writing, in the US, spacecraft regulation includes:

Licensing for radio communication;
Licensing for performing remote sensing, such as imaging of the Earth;
Licensing for launch or entry into the atmosphere, to ensure there is no conflict between the vehicle and aircraft along the path;
Licensing for launch, to ensure safety of the public;
Orbital debris analysis and mitigation; and
Export control of certain technologies.

These regulations are spread over multiple agencies, and are changing rapidly as commercial uses of space change.

33.4.7 Third party objectives

No systems operate in isolation. Instead, they operate within the context of a larger system of people, businesses, and organizations. This might include:

These systems and organizations are stakeholders, whose needs or objectives must be understood and addressed in the system.

The interactions and dependencies within this larger system create constraints on how the system being designed must function. It is important to identify each of these third party stakeholders, document how the system will interact with them, and then document the more specific objectives that are involved in working with them.

This information should be collected into one or more documents that record, first, the structure of the larger system and its interfaces with the system being designed; and second, the sources of constraints or objectives for each interface.

Information about the ecosystem in which the system will operate is likely to change frequently over the course of developing a system, especially for visionary projects. This means that it is important to update information about these objectives, and when it changes, flow those changes down into the system design.

Example: communication services. Consider a system of multiple vehicles—such as cars, trucks, or small UAVs—that need to communicate continuously with a central operations facility. The system itself is the vehicles and the operations facility. The communications are likely to be provided by a third party: a cellular communications company, for example.

As the system design progresses, the team will be able to define more and more accurately what capabilities are needed from the communication system. How reliable does it need to be? Can there be areas with poor or no coverage? What data rates are needed?

At the same time, communication providers will have their own constraints and capabilities. This might include pricing—both how pricing is calculated (Flat rate? Amount per data transferred?) and what the rates are. It might include their coverage area, and their mechanisms to provide information about outages or new coverage. It might include terms of use, with restrictions on what kind of data can be transmitted and what security measures the system must take in order to be connected to the provider’s network.

Example: spacecraft launch provider. Most spacecraft launches are performed by a company different from the organization that builds and operates the spacecraft. The launch service provider is responsible for receiving the spacecraft from its builder, integrating it onto the launch vehicle, and placing the spacecraft in a designated orbit. The launch provider is in turn responsible to regulatory agencies that ensure that the launch operations are safe, and in many sites the launch provider must work with a range safety organization (in the US, the US Space Force provides range safety for the Eastern and Western Test Ranges).

There are two classes of interactions between the launch vehicle and the spacecraft: the effects that the spacecraft can have on the launch vehicle, and the effects that the launch vehicle can have on the spacecraft. The provider gives the spacecraft designers specifications of the launch vehicle, including how the spacecraft will be attached and released; what vibration, pressure, and thermal environment the spacecraft will be in during processing and launch; and what communication is possible between the launch vehicle and the spacecraft. The provider also gives constraints on what the spacecraft can do, such as constraints on the spacecraft’s mass, volume, center of gravity, or gas releases. The provider also gives safety constraints, such as the allowed propellants or toxic materials, the state of batteries or other energy storage systems, or the permitted electromagnetic radiation. These constraints usually derive in part from the launch provider’s safety certification with the appropriate regulators or range safety organizations.

Most launch providers make a Payload User’s Guide available that documents this information.

Example: safety-critical component provider. A recent project I worked on involved acquiring a number of sensors for measuring the environment around a vehicle, so that the vehicle could safely plan a path around obstacles. Some of the sensors were not yet available in production, and the team had to work with the providers to obtain evaluation units.

The interaction between the team and the sensor provider was typical of interactions with providers in general. Negotiations between the team and the provider covered topics like:

Minimum numbers of units to be purchased once the component went into production;
How much non-recurring engineering costs (that is, custom development costs) each party would cover;
Limits on what the sensors could be used for, in order to manage competition between the team and other companies purchasing the component;
Exclusivity agreements on custom features or component usage;
Quality control standards; and
Processes for delivery, acceptance testing, or rejecting a defective unit.

These issues do not affect the core technical function of the component. Some of them do, however, place constraints on how the team can use the component (it might not be possible to repurpose the sensor for any arbitrary function). Other issues, such as quality control or acceptance testing processes, affect the safety of the system that incorporates that component.

As a result, these constraints also need to be captured in the system purpose, and the system’s design must be validated against these terms.

33.4.8 Safety objectives

Safety is the condition that a system, when operated in the intended way, does not produce too many events that cause harm. There are four parts to this statement:

In the end, a system must be shown to be safe by showing that the rate at which it causes harm is below a threshold. The process of designing a system to be safe is well known to be a difficult task, and there are many books and standards that try to give guidance on how to do so. As the system is designed, it must be evaluated to show the likely rates at which harmful events will occur.

This is a complex topic, and later chapters address the design and analysis of safe systems.

The ways in which a system needs to be safe derive from customer, organization, and regulatory needs. Safety is not a stakeholder itself; rather, safety needs come from those stakeholders. A customer may care about harms to its workers or property; a regulator may care about environmental and social harms. An organization may care about its reputation for building safe systems. The definitions of what makes a system safe or not, thus, come out of stakeholder needs.

Defining safety objectives is the first step in designing a safe system. This involves defining what kinds of harms are to be measured, along with the acceptable rates at which they occur. There is no possible way to justify that a system is “safe” or “unsafe” without defining the harms they refer to. Safety objectives, as part of the system purpose, inform the development of the system concept and later its specification and design.

A harm is some undesired outcome of a system’s operation. A well-designed system avoids or minimizes these outcomes. There are many synonyms for this: a loss, an injury, a hurt, or damage.

There are many different kinds of harms, and each system will have its own set that matter. Examples include:

Each project must define and document its own high-level safety objectives in terms of the harms and the acceptable rates of those harms occurring.

Some industries have conventional definitions of harm and rates. The automotive industry has adopted a scale of zero to three for “severity” in the ISO 26262 standard [ISO26262], focused entirely on injury to persons. Severity 0 is no injuries, 1 is light to moderate injuries, 2 is severe injuries with survival probable, and 3 is severe or fatal injuries. The aviation industry has defined a five-level scheme in the ARP 4754 standard [ARP4754], ranging from minor (slight increase in crew workload or minor passenger inconvenience) through hazardous (serious or fatal injuries among passengers) and catastrophic (many deaths, loss of aircraft).

These two standards differ in two respects. They consider different ranges of harm: ISO 26262 has any severe or fatal injury as its highest category, while ARP 4754 considers the distinction between fatal injury and mass fatality. They also consider different kinds of harms: ISO 26262 only considers injury to persons, while ARP 4754 considers effects on the crew’s ability to control the aircraft and damage to the aircraft.

These point to deficiencies in the standards, and to the reason why a project should define its own safety objectives carefully. There are many harmful incidents that these standards do not address, such as damage to property. Consider an incident involving a truck that damages an overpass, but does not injure anyone directly. The cost of repairing or replacing the bridge can run to several millions of dollars; the economic impact on the community of not being able to use the bridge can be equally high. In addition, depending on the industry, the range of severity in these standards can also be too limited: they do not account for harms that spread beyond the people and vehicles immediately involved in an incident. The use of aircraft as missiles in the 9/11 attacks showed how an aircraft safety incident can result in mass casualties or worse.

In addition to defining the harms that system design will consider, the safety objectives set targets for how often those harms can occur. Sometimes a potential harm can be designed out, so that there is no chance of it happening. Most of the time harms can only be minimized, not eliminated: an aircraft can only avoid falling if it never flies, and a boat can only avoid sinking if it is never put in the water.

Instead, the safety objectives will need to define how often each harm is allowed to occur. This is typically defined as some maximum amount per unit of time or activity.

Guidance issued for commuter aircraft [FAA11], for example, gives a maximum allowed rate of incidents per flight hour for commuter-class aircraft:

Some organizations may choose to say that the system they build should allow zero safety incidents above a certain level. This is possible only if the system can be guaranteed never to perform operations that could induce such serious events. For example, an aircraft can be guaranteed never to cause catastrophic harm, involving multiple fatalities—but only if the aircraft has a maximum weight of a few tens of kilograms, a low maximum speed before it disintegrates in the air, can only carry a single person, and so on. No transport aircraft (more than 19 seats or maximum takeoff weight greater than 19,000 lbs) that actually flies can ever have a zero rate of catastrophic harm, if only because of its kinetic energy while flying. Similarly, many weapons systems can never have a zero rate of mass casualty harms simply because of the energy they carry. In most cases, as the conventional wisdom goes, the only way to get a system to have a zero rate of harm is not to build the system.

An engineering team needs defined and measurable harms and rates in order to design and build a safe system. The team will often have to make trade-offs between one safety objective and another, or between safety and something else. They will need to judge how much effort to put into meeting some safety aspect of the system, and prioritize effort among multiple possible tasks. However the objectives are defined, the team must be able to determine whether they have reached a solution that is good enough to meet the objectives. They must be able to agree on what is needed, and build component behaviors or structures that, when integrated with others, result in a safe system.

Defining precise safety objectives early in a project is required for building a safe system. I have observed many projects that made aspirational statements about “safety being a first priority”. In every single instance where the definition stopped at that statement, the team designed an obviously unsafe system—often because, in the absence of an objective standard, each person took steps they thought would be safe but in aggregate the design missed even basic scenarios that resulted in hazards. Further, the absence of an objective meant that no one could perform an objective analysis of a design to determine whether it was good enough.

Category	Harms	Rate
Minor	Physical discomfort; use of emergency procedures	$10^{-3}$
Major	Physical distress or injuries	$10^{-5}$
Hazardous	Serious or fatal injuries	$10^{-7}$
Catastrophic	Hull loss; multiple fatalities	$10^{-9}$

33.4.9 Security objectives

Security objectives are closely related to safety objectives: both are concerned with naming harms that the system should avoid, and both derive from other stakeholder needs.

Security objectives differ from safety objectives in three ways. First, several kinds of harm are typically considered “security” concerns rather than “safety”, such as disclosure of confidential information or unauthorized system use. Second, security typically deals with harms that are not well characterized by rates of occurrence: events that are so catastrophic or unrecoverable that they should not occur even once. Finally, security often deals with harms that can be caused by malicious, intentional actions.[1] Security objectives are generally not characterized by rates of harm because the the hazards do not occur regularly or randomly; they occur when a malicious actor creates a hazardous situation.

As with safety objectives, security objectives are defined in terms of harms to be avoided. These harms generally include all the harms identified in the safety objectives, as well as things like information disclosure, interruption of business, financial loss, or theft of goods.

Unlike with safety objectives, the security objectives also include the list of threat actors. These are the people, organizations, or systems that can choose to initiate an attack on the system. Each threat actor will have their own motivation: a criminal organization for financial gain, a nation state to disable defense-relevant capabilities. The likelihood of some harm happening, the amount of resources to apply to avoiding that harm, and the ways to avoid the harm depend on the actor and their motivations or incentives for causing a problem.

The system must then be designed to address the different harms that different threat actors might pose. The resulting design can be analyzed to determine whether the threats are sufficiently addressed. The built system can be tested to verify that key defensive features are working as intended.

The definition of “sufficiently addressed” remains subjective. Some security analysis techniques have rationales for assigning weights to different threats. For those analyses, ensuring that all high- and medium-priority threats have been mitigated might be sufficient.

There are many standards related to security, and depending on the industry and geographic region compliance with some standards may be mandatory. These may define security objectives that a system must meet for regulatory or business acceptance. This information should be documented in the regulatory objectives, and information about threat actors or harms should flow from the regulatory objectives into the security objectives.

33.4.10 Synthesis

Previous steps investigated what each stakeholder needs individually. In the end, the project must satisfy all of the stakeholders.

The next step, then, is to create a synthesis of all of the stakeholder needs. In simple cases, this is just merging the different needs together: the customer needs X and Y, the funder needs Z, and so on. The difficult cases are when there might be conflicts between different stakeholder needs.

Consider two examples. In one case, the customer wants a product within 24 months at a given price. However, the product will require regulatory approval, and the approval process takes at minimum 12 months if nothing goes wrong. These two stakeholder requirements could conflict if the approvals aren’t completed within 24 months or if the cost of getting approval becomes too expensive. In another case, the funder requires a minimum return on investment to fund other projects in the future. The customer is price-sensitive and is only willing to pay up to some maximum amount. These could conflict if the cost of the project’s development and deployment is high enough that it doesn’t leave enough profit for the funder.

Note that neither of these examples are certain conflicts. They represent serious risks to project success, but they do not indicate that the project cannot succeed. These potential conflicts are added to the list of risks for the project (Chapter 66). These will need to be analyzed during concept development (Chapter 34).

Sometimes there will be clear conflicts between stakeholder needs. A customer may want a capability that is not allowed by regulators. An organization may want to impose constraints that mean customer needs cannot be met (e.g. Sections 4.1 and 4.4). When the team finds these kinds of conflicts, they can either negotiate with the various stakeholders in hopes that one or more of them will relax their needs, or the team can determine that the project is not feasible and stop the work. The decision to stop the project is part of the stakeholder review and agreement milestone (Section 33.9). The worst outcome is not to find out about conflicts until lots of time and resources have been spent.

33.4.11 Results

The outcomes of investigating a system’s purpose are records of what each stakeholder needs, along with checks of whether those results are complete and consistent. Some form of these records will be checked by stakeholders as part of validation (Section 33.8) and a review of purpose (Section 33.9).

Again, it is important to limit the exercise to determining needs and purpose. The work should not expand into developing a concept or specifications—yet.

I have found it useful to maintain two separate kinds of records. The first is an unabridged, untranslated record of what each stakeholder has said—regardless of source. The second is a purpose statement for the stakeholder, an organized form of their needs. The information in the organized form should be specific and actionable, and measurable where possible. These records should use the stakeholder’s own language and frame of reference.

I have structured the record of a stakeholders needs in different ways. It should be simple. A bulleted outline of problems to solve, functions needed, and needed attributes like reliability or security can work, along with a few diagrams to explain them.

The stakeholder will review and validate the purpose statement. This is the basis for them approving the team’s understanding of purpose or not.

The project team will develop other information along the way. This consists of things like use cases, definitions of broad functional needs or partial potential system concepts, and records of gaps or inconsistencies in the stakeholder needs. All this information is developed by the project team for their use; it is derived from what stakeholders say, but it is not a record of the stakeholder needs. (The team must take care never to confuse this interpretation of stakeholder needs with the actual stakeholder needs.) This information can be in any form useful to the project.

33.4.12 Techniques and tools

There are several tools that a project team can use to help the process of learning what stakeholders want. I often expect to spend many hours or days discussing with a customer to understand their needs, at least for complex systems. Other stakeholders may not require as much time.

Some stakeholders may not be able or allowed to discuss interactively and one must deal with written information. I have seen this most often with government stakeholders. When a public entity is requesting competitive proposals, they are often limited to interacting in writing and to sharing any questions and feedback with all potential proposers. Regulators are obliged to follow the text of their regulations, and in trying to explain them they can potentially convey wrong impressions. They often lack the people and time to answer many questions anyway.

When discussing with a stakeholder, the first tool is to keep a record of what they say. This can be in notes, audio recording, or video depending on the circumstances. (I find that notes are useful in addition to any recording.) It should be very clear to the customer that the team is making a record; this conveys a message that the team is paying attention. However, the team should check with the stakeholder about how they will build a record to address any confidentiality concerns that they might have.

Active listening is a set of the techniques that helps understand what a stakeholder needs. There are many resources available to explain it in detail. The purpose of the techniques are to put the focus entirely on what the stakeholder knows, listen with the intent of understanding them, and to convey implicitly that one cares about what they know more than about finding a solution at the start.

At the beginning of learning about a stakeholder, the listening is focused on hearing from them. As the process goes on, project team members will begin to ask more questions and gradually direct discussions to specific topics, filling in gaps in understanding. Even as the team ask more questions, though, they must keep the focus on learning from the stakeholder, and not putting words in the stakeholder’s mouth.

Asking open-ended questions is another useful tool. By asking a question that indicates interest in a subject, but not giving potential answers, allows for the possibility of learning something unexpected. If the stakeholder has need B, and a team member asks a question like “for topic X, are you thinking of A?”, then the stakeholder might just answer “no” and the team will not learn of B. Or worse, the question may lead the stakeholder to start thinking in terms of A when they really did mean B. It is better to ask a question like “what are you thinking about for topic X?” and let them answer as they will. Only ask “are you thinking about A” when you are confirming that they have said “A”.

Getting information from multiple people within a stakeholder’s organization has two benefits. It can provide different perspectives on some need, providing more information about needs. One person might consider something more important than another person does; the first person will likely emphasize that need more than the other person. The other benefit is that it can expose differences of opinion or inconsistencies within the stakeholder. These differences can cause problems for the team down the line, when a stakeholder must approve part of the system. The team can record these disagreements and work to ensure they are resolved before they become a problem.

Shadowing part of a stakeholder’s operation can be a productive way to understand operational needs. (It is not particularly useful, for example, with a funder.) By following along as a customer’s staff goes about their business, the project team gets a different perspective than they would talking about things in a meeting room. They will see little details that the customer staff might not think about because they take those details for granted. Early in my career, I was asked to work out how to automate some city government finance operations. Processing utility bill payments was one of those functions. It was not until I sat by the person who received envelopes from the post, opened and processed each one, and recorded the necessary information did I actually understand the manual steps involved and the importance of the physical environment—desk, shelves, sorting baskets—that they relied on to do the job. The result was to conclude that further automation was not useful for that task at the time, so I could put my energy into other functions.

I have found that acting out scenarios is a powerful way to engage stakeholder imaginations about potentially complex operational needs in a system. In discussing UAV traffic management with a government organization, we were having difficulty discussing malicious behaviors that UAV operators could exhibit. When I then tasked two of the people in the meeting to act out good and bad scenarios, they immediately understood the problem I was trying to raise. After that we had a productive discussion about the problems that a traffic management system might need to address.

Behind the scenes, the project team can build models of what the stakeholders are saying in order to organize the information they have found, such as definitions of system users, use case models, and activity models record who will interact with the system, and for what. Definitions of interactions between a system and the outside world record information that will inform system scope. Lists of functional needs, perhaps organized into groups or hierarchies, help for finding where there may be topics that the stakeholder has not addressed or where there are contradictions. All these models derive from the raw information received from the stakeholder, and the team must ensure that the models only contain information that has been expressed by a stakeholder.

These models are for the project team’s use, and generally not for the stakeholder to see. The models will be expressed using language or notations that the team understands, but the stakeholder likely does not. The team uses these models to prompt confirmation questions, find gaps, and identify contradictions. These models likely include information from multiple stakeholders, and some of that information should not be shared between stakeholders.

Most important, the models are not the reality of stakeholder needs; they are an interpretation. The models are hypotheses about the stakeholders, and as hypotheses they must be investigated to confirm or deny their accuracy.

It is possible and often tempting to put too much energy into developing such models. While discovering stakeholder needs, all this work is tentative; there is some probability that there will be a surprise coming along soon that will invalidate some of the hypotheses encoded in the models. The team should put only enough effort into these analyses to further understanding the stakeholder. These models will be improved during the next phase, when the team works on system concepts. The effort involved in elaborating use cases and finding solutions should be deferred until concept development and later in design.

Because the intent of working out a system’s purpose is to have an accurate and complete understanding of stakeholder needs, one takes deliberate steps to check the accuracy of what one thinks one has learned—that is, to validate the understanding.

33.4.13 The project team

The team members who work on discovering project purpose need a few different skills.

They will be interacting with the stakeholders. The team members need social skills to build rapport with the stakeholders. They need skills in communicating and listening.

The project team translates between a stakeholder’s point of view and language to the team’s language. The team needs someone who can understand both groups’ ways of expressing information. This can come by team members learning the stakeholder’s language, or by ensuring the team has someone with experience in both. Sometimes bringing in a consultant meets the need.

The team also documents its understanding of the stakeholder needs. This involves skills of organizing the information in ways useful to the team. I have found it helpful for the team working with a stakeholder to include people with engineering experience as well as people with marketing experience. (I discussed one example in Section 4.2.)

Finally, the team produces documents of its results. The team should include someone who can write text or presentations that are concise and clear.

For complex systems, all these skills are rarely found in one person. It is often more effective to assemble a small group—perhaps two to four team members—to work with the stakeholder.

33.5 Competition

Virtually every systems project will be in some kind of competition—whether for a customer contract, for sales of a developed system, or for acceptance of a new technology over an existing approach. A team can develop a good concept or a good system, but then fail to get that system used.

Knowing about competition applies to every project, not just those which must generate a competitive proposal. A customer-driven project must still satisfy its customer; the customer will be aware that they have choices about what investments to make in new systems or upgrades. A visionary project may have direct competitors who may try to build similar systems—but visionary projects also have competition from the way problems are already being solved, as a customer can always choose not to by any new system and stick with what they already have.

Understanding competition provides constraints on the system and offers ideas about possible objectives. Does a competing system have some special value to customers? It won’t be enough for this system to match the other one; the system will need to do better or offer some other value to the customer. Does the competitor have a weakness in their offering? This system should have a better solution in that area. Is there something that this system can have that will help it keep an advantage over time? These are all questions that business strategy and marketing teams investigate regularly.

The team should gather this kind of information into a competition document. That document gathers together intelligence about who and what might compete with this team’s system. It lists strengths and weaknesses of each competitor. The competition information informs working out the system concept in the next steps (Chapter 34).

The competition document must be an unbiased presentation of the alternatives to using the system being designed, and of the advantages and disadvantages of those alternatives.

Many people will naturally want to emphasize what they see as their own strengths and try to contract the competition to those strengths. That makes for a misleading competition document.

The competition must be presented as fairly as possible, and from the customer’s point of view. The document must be honest about the strengths that competitors have: they will have strengths and the team cannot defend against them if they do not have an accurate assessment of them. The document must be equally honest about the competitors‘ weaknesses. The team cannot design a better solution if they do not accurately know what customers don’t like about what their competition offers (or might offer), or if they don’t understand what structural problems the competition might have in designing or building their own offering.

The competition document is never really complete, because other teams and other technologies will always be changing. The competition should be revisited periodically as the project continues to find when the competition has changed, and determine when those changes need to lead to changes in the system. These changes amount to a change of system purpose, and lead to changes in the system concept, then specification, and onward. Section 33.10 discusses this process.

33.6 Defining scope

A system’s scope (Chapter 10) defines the boundaries of the system. The boundary separates two things:

The project cannot affect what is outside the system. They take that as given and work to accommodate it. This includes the environment in which the system will operate; the users who sit outside the system and use its functions; and rules that the system must follow. Understanding this environment is an important part of understanding system purpose.

The project builds the system to handle everything that is inside the scope boundary. That includes passive elements like mechanical structures; active machinery like computing or mechanical systems; and people who work within the system.

The project team has some control over the scope, especially at the beginning of a project. The stakeholders’ desired purpose is sometimes more complex than what can be developed feasibly in a reasonable time. The project team can address this three ways: by negotiating a smaller scope for the system; by negotiating a phased delivery of system versions with increasing functionality; or determining that the team should not try to build this system for these stakeholders.

Negotiating about scope involves ranking needs to find a subset that is essential and that will still satisfy stakeholders enough. The ranking includes identifying what needs are interrelated and must be satisfied together or not. This may not be possible to complete until the project has developed some of the system concept, as discussed in the next chapter (Section 34.10.3). Similarly, if the project proposes delivering a series of system versions with increasing scope, it may be necessary to get part way into concept development in order to understand what would make sensible versions.

If the project team expects that the system’s scope will need negotiation, and that the scope changes aren’t clear just from looking at purpose, the team should document the concerns, ensure that stakeholders know there is a potential concern, and include scope negotiation on the list of risks that will have to be addressed after the purpose is baselined initially.

33.7 Artifacts

The organized lists of stakeholder needs, the list of risks, and internal analyses must clearly derive from what the stakeholders have said. Ideally, anyone looking at this derived information can trace that to specific statements from the stakeholders on a given date.

This information is maintained under configuration management (Sections 17.4 and 23.7). It is in progress while being developed, and is baselined after the purpose review (Section 33.9). The team will develop new versions as they learn additional details about stakeholder needs or as those needs change (Section 33.10).

33.8 Validating purpose

The final system purpose should reflect each stakeholder’s needs accurately. Validation is the process of checking that the purpose is indeed accurate.

Validation is not a single task. It is a continuous process. Small validation steps occur all the time while the project team is learning what a stakeholder needs.

The project team validates information in order to avoid wasting effort or time on something that will be rejected in the end. That is, validation is in part an exercise in risk management. The sooner significant misunderstandings can be caught, the less time will be wasted. The active listening techniques discussed above for listening and confirming understanding during discussions provide quick feedback.

I have found it useful to have mini-reviews along the way to a final review. In these, the team takes one or two topics, summarizes their understanding back to the stakeholder, and validates the work in progress. This is useful when the project team has been assembling stakeholder information into organized forms, so that the stakeholder can check both the way the information is organized as well as the specific details.

At the end of the process, a team should review their understanding with each stakeholder. This review might take the form of a document that presents an organized record of their needs. It might include a presentation, though presentations are useful for conveying the big picture of the needs and are not good for presenting details.

33.9 Review milestone

The goal of initial purpose development is to have a clear but informal understanding of what the stakeholders want, what constraints they place on the system, and agreement from all parties that the documented understanding is correct.

There is a review at the end of purpose development. This review is to check that these conditions have been met, both for the team and for stakeholders. At this review, the project decides whether the team should continue on to concept development.

While the team should have been validating their understanding with each stakeholder as they go, a final check is in order before moving on to greater investment in the concept development step next.

At the end of each significant project phase, the team also reviews internally whether the project should continue or not. At the end of purpose development, this decision is based on four things:

Not all gaps and inconsistencies in stakeholder needs will necessarily be resolved by the end of the main purpose development step. That can be okay; the customer or other stakeholder may not know yet, and the team will have to work with them during concept development and negotiation. The decision at this review is based on the likelihood that the team can work with the stakeholder to resolve inconsistencies or gaps during concept development, before the project needs to begin specification and design.

Sometimes, however, an inconsistency is a sign that a stakeholder does not actually understand their own needs well enough and is asking for something impossible. This can be a sign that the team should not continue with the project.

The last condition can occur when the customer will be required by regulation or ethics to have some capability, but the customer is not aware of the need or not willing to gain the capability. For example, a customer wanting to fly an aircraft commercially may need to obtain an operating certificate as an air carrier, and establish capabilities like a safety management program. As another example, a customer might want a system to process radiological agents but is unwilling to discuss security measures to protect the material.

If the project determines not to proceed, then the team can wrap up what it has learned, archive the records generated, and inform stakeholders.

33.10 Change of purpose

A project’s purpose will change as time goes by. Indeed, change will be continuous over the life of a project and can happen at any time. No matter how much effort the project team puts into learning stakeholder needs, there will be needs that aren’t understood until later. Sometimes a need is not discovered until a system has been developed and deployed, and users finally realize something about what they do. Stakeholder needs change over time: as their competitive landscape changes, as regulation changes, as they discover new opportunities. For visionary projects, the potential customer market segments change as people discover new potential customers or find that some segment isn’t as promising as thought.

Changes that are discovered while developing the system purpose can be incorporated right away. Changes that occur after the purpose has been reviewed and baselined require more care, because system development will have proceeded based on the old needs and some parts of the system’s specification, design, or implementation may need to be changed in response.

The process for dealing with changes after the purpose has been baselined is roughly (following Section 29.7):

Note that changing the purpose leads to changing the artifacts that follow from it; see Section 34.5.7 for additional notes.

33.11 Using purpose

The system purpose encodes what customers and other stakeholders need or want. The system that is eventually defined, built, and deployed is worthwhile if it meets those needs; if it doesn’t meet the needs, the effort spent to make the system is wasted. In other words, the purpose guides all the rest of the work on the system.

While a first version of the purpose is developed at the beginning of a project, it will change as time goes by. Making the purpose as complete as possible is an exercise in risk management. Changes to purpose lead to changes in all the artifacts that derive from it—concept, specification, design, and so on. These changes to the system mean that some of the effort invested in developing to the original purpose will be lost, and more effort will be required to make the changes. The amount of effort lost may be small if the change is small, but a change in purpose can potentially lead to large changes in the system design and thus a lot of extra effort. A project can reduce the amount of extra effort by putting effort into working out the purpose first.

The purpose is recorded in a number of documents. These documents are kept under configuration management so that it is clear to anyone who needs to use purpose information that they are using the correct version, not something out of date.

The team uses the purpose directly to guide the concept they work out, and they judge the concepts by how well they satisfy the purpose. The system specifications start with the purpose and the concept, and then elaborate them into details about the system.

The team uses concept and specifications as they proceed into design and implementation. In this way they are using the purpose indirectly.

The team verifies each step of development, from purpose and concept to specification, from specification to design, and from design to implementation, to ensure that the system as finally implemented meets its purpose. However, this is like a game of telephone where the chain of interpretations can lead to differences between the original purpose and the final implementation.

To counter these accumulated errors, projects validate the final system implementation directly against the original purpose. The customer or other stakeholders can participate in the validation activities. This is part of the final system acceptance (Section 28.9).

When there is a request or need to change the purpose, the team looks at the changes to determine whether to agree to make the change as discussed above. If so, the team updates the old purpose with the changes, produces a new version of the purpose documents, and propagates the changes into the artifacts like concept and specifications that derive from the purpose.

The purpose also plays a role in contracting. If the project is developing the system for a specific customer or funder, they will develop a contract that determines what the project will build. That contract usually includes much of the system purpose as the definition of what the project is to deliver. Some of the purpose may be first documented in a request for proposals. The team typically works out purpose and concept as part of developing a proposal for the contract (Section 25.1). The purpose and parts of the concept are often then embedded in the contract, possibly by reference.

The team should bear in mind what they do not yet know once they have worked out the purpose. Purpose development is, obviously, a very early stage of a project. They do not know how they will approach designing and building the system; that comes with working out the concept. They do not know how much time, effort, or other resources will be required; that comes only after specification and some design has been worked out. They also do not yet know what they don’t know; there will be surprises in what stakeholders need as the project moves forward. The team must resist providing estimates of what will be required to make the system based only on the purpose [McConnell09, Chap. 1].

Purpose development happens before milestones in which the project decides whether to continue onward to build a system. It happens before a contract for the system has been signed. There is, therefore, a reasonably likely chance that the project will end after the effort to work out purpose (and concept) has been spent. In many organizations this means that the work of developing purpose and concept is funded internally (using “business development” funds). The process to get approval to use these internal resources is dependent on making the case that the project has a reasonable chance of getting funded; see Chapter 26 for a discussion about the case for a project.

Chapter 34: System concept

34.1 Introduction

At the very beginning of a system development project, there is generally at most a rough idea of what the system should be. The understanding of the system objectives is too vague to launch into development or writing tests right at the start.

The concept comes after working out the purpose for the system: who the stakeholders are that have an interest in the system, and what they need.

The system concept is the high-level summary of the system that the team will build in order to satisfy those needs. It defines—in a coarse, high-level way—the structure and behaviors of the system, who will use it and how, and how the team will approach building that system. It defines the scope of the system, that is, where the boundary lies between what is in the system and what is outside it.

The exercise of putting the concept together involves exploring different ways that the system could be organized. It is the time to investigate multiple high-level designs to see which ones best satisfy stakeholder needs, and to learn about the problems to be solved building the system. It is also the time to build up an understanding of major risks that could arise.

The concept that is selected at the end of concept development is one of the most important, near-irrevocable decisions that the project will make. Once the concept has been decided upon, the rest of the project’s work is to realize that decision. This means that the choices made in the concept must be well-informed and made carefully. At the same time, the concept does not have to be perfect; it will be revised as the project moves forward and the team finds the inevitable problems with the initial concept. Instead, the initial concept needs to be correct enough to bring the risk of later surprises down to an acceptable level.

Concept development formally ends when a concept review approves the concept documents. At this point the documents are baselined, meaning they are now stable and can be used to develop the system. Any further changes to the concept will require additional review and a new baseline.

After the concept development phase, the record of the concept remains important. Because it is the definition of the high-level system structure, it informs how the system is decomposed into components (Section 11.3). It defines high-level structure and interactions between those components (Chapter 12). The concept and the system purpose are the basis for system specifications (Chapter 36). All this information that follows from the concept should be traceable back to the concept.

Later in the project, the concept provides a guide to new team members who need to learn about the system before they start working on details. It serves management as a definition of the project’s goal, allowing the team to check whether they are designing and building what they intended to build.

34.2 Why build a concept?

First, the concept provides guidance for all the project’s later work. All the specifications, designs, and implementations will be about building the system defined in the concept. The concept helps unify all that effort toward building one consistent system. In the absence of a well-structured concept, the team will be taking undue risk designing and implementing parts of the system.

The finished system concept provides education for people joining later. It is a high-level summary of how the system is organized and functions, and new team members can use it to learn how the parts of the system fit together and how the system will be used.

The work of developing the system concept is an exercise in learning about the problem. What past work has been done that this project can reuse? What different ways are there to satisfy stakeholder needs? What major unsolved problems might lie ahead?

Building the concept is an opportunity to check feasibility. It is the time to see if everything that the customer wants is in fact possible—and if not, this makes that apparent and can be the trigger for negotiating something feasible with the customer.

Finally, the system concept provides an opportunity to validate that the team properly understands stakeholder needs. Building up the system concept requires examining stakeholder needs in the purpose in detail, and doing so can reveal inconsistencies and gaps in what the stakeholders have asked for. This becomes an opportunity to work with the stakeholders to clarify the system purpose. When the concept is complete, the team can share it with customers and other stakeholders to see if it meets their expectations.

The concept also supports the relationship between the project and customer by defining what the team will deliver. For projects working under contract, the concept informs the deliverables that both parties agree to. When a stakeholder has agreed to a concept, there is a clear basis for determining whether requests later on represent a change to the agreed-upon concept or not. This can lead to fewer difficult disagreements with a stakeholder.

A project that does not put time into developing a system concept runs several risks. There is a good chance that the parts of a system that they develop will not fit together into a coherent system because there is no overarching goal that the parts fit into. The project is likely to miss important alternate design approaches because they move too quickly toward some one design. There is also a good chance that whatever system they build will not meet stakeholder needs because they do not validate their approach early, leading either to expensive rework or a system that fails to meet needs.

One project I supported jumped into developing a few core technology components of the system they were building without taking time to think through the big picture for the work. The result was that when they began a couple years later to articulate what the whole product system might be, they had significant gaps in what they had built; they would need to re-engineer some of the major system components. Perhaps more important, they had developed institutional ways of thinking that made it hard for them to see where they needed to make changes for the whole system to work well.

In another project, a set of regulatory agencies developed a body of regulations and supplementary material about a kind of system that should be built. The result of the exercise was only an EU Implementing Regulation and a document expanding on those regulations. The regulation and commentary provided many details about the components in the system, but they did not record the big picture of the system: what behaviors as a whole the system was to provide, and how the components should support those behaviors. Instead each organization interpreted the regulations as best they could, and we found that different people had widely divergent understandings of what the components should do or the behaviors of the system as a whole. We spent a lot of time trying to reverse-engineer the concept from the hints in the details, and in the end it is unknown whether the concept we developed matched the regulators’ intent.

34.3 What a concept is and is not

I discussed what the concept includes in Section 28.4. The concept is a high-level view of what the system will do and how it will do it. It presents both the external and internal views of the system. The external view documents its behaviors and interfaces as seen by users outside the system. The internal view sketches the major parts and their behaviors that achieve the externally-visible behaviors. The scope—the boundary between the internal and external—provides a boundary to the system.

This is a reductive way of looking at the concept, focusing on a breakdown by parts. This can lead to definitions of components or interfaces that don’t quite work together to meet the system’s objectives.

The concept should include a second, operational perspective on the system, focused on use cases or scenarios. This perspective shows how different externally-visible actions flow from a system user, through an interface, through components, and back through interfaces to users.

One can use the two perspectives to check the accuracy and completeness of both. When tracing activities through the operational perspective, each step of the operation should have a corresponding behavior in the structural perspective. Any operation that crosses the system boundary should be supported by an interface at the boundary. All the external behaviors in the structural view should have a corresponding flow of operations, and every operational flow should reflect some listed external behavior.

The information about the structure and operations in the concept is the outcome of decisions made by the team. The concept should therefore be backed up by rationales that describe why the team made the decisions recorded in the concept, and the arguments for why this particular system concept will meet the system’s purpose (that is, stakeholder needs).

Many projects will also include information about how the system’s development can proceed: some ideas about the order different parts should be developed and tested, the organizations and resources that will be part of the project, and some major milestones that might measure progress.

At the same time, the concept is not the complete specification of the system. It is just a sketch; it should not be very detailed. Work on the concept should stop when it appears likely to cover the most important use cases and stakeholder needs, is likely feasible, and appears to be self-consistent. It should have explored the system enough to reveal unexpected constraints or design risks, without necessarily solving all of them. It should not be very detailed; part of its purpose is to prepare for working out the details.

I have seen many projects try to turn the concept into the full system design. In doing so, they have bypassed important review steps and have produced an unwieldy document that people can’t follow. They have also produced a system design that has many hidden gaps or errors, and they have skipped the design and evaluation steps that would have found those.

As a project progresses, the concept’s role changes from being the initial possible design to being a high-level guide to the system. It becomes a way for new people to learn about the overall structure of the system. In this role it needs to summarize, rather than go into detail. It needs to be an accurate guide, which means that the team must update the concept as design and implementation proceeds. Later design steps are likely to find problems with the concept as initially developed, or find better ways to do something.

34.3.1 Concept versus CONOPS

I use the general term “concept” in this work as opposed to the term “concept of operations”, usually abbreviated CONOPS. In many organizations, these terms are be used informally and interchangeably.

Formal definitions of each can differ. For example, the NASA Systems Engineering Handbook [NASA16, Section 2.3] talks about a concept of operations as being a scenario of how a mission system can operate, and treats it as one part of the overall concept. The NASA model separates these operational scenarios from the work breakdown and mission architecture.

By comparison, I use “concept” in the broader sense of including system structure, users, usage scenarios, and interfaces.

34.4 Developing the concept

I will use a simplified version of the project I discussed in Section 4.1 to illustrate techniques. This was a NASA technology demonstration mission that was intended to show how multiple small spacecraft could work together to perform science missions. The early mission analysis identified two key technologies: the ability to operate the collection of spacecraft without having to communicate directly with each one from ground stations and the ability to communicate between spacecraft (“cross-linking”) to share commands and data throughout the collection. As with many NASA missions, the initial parts of the concept were worked out by an office dedicated to developing new mission ideas, with the initial concept handed to the project team to finish out.

34.4.1 Incremental development

Some projects can work out the initial concept quickly from start to finish. Other projects will require time and multiple iterations to develop the concept.

The amount of effort required to develop the concept depends in part on the project’s scope and on how much has already been determined. When a project is responding to a customer’s request for a proposal, there are often many constraints that narrow down the possible system system concepts. When a project is evolving an existing system, there is often only a narrow range of acceptable solutions. On the other hand, when a project is creating a new kind of system there are fewer answers already determined.

Working out the system concept is a process. The solution does not usually arrive in one flash of insight. The team should expect to work out the concept in pieces, and to backtrack over parts that once seemed good and settled.

This is an experimental process. It involves building up a set of hypotheses about what the system should be and then testing them. In many projects, the team will develop multiple partial concepts and discard them as they are evaluated. The team might develop prototypes or proofs of concept for key items to see if they are feasible. The team might split into groups, each working on a different, competing approach. This work of evaluating the possibilities is an essential part of the process.

Concept development can end when the concept is likely feasible, likely meets stakeholder needs, and is agreed upon within the team.

The concept is likely feasible when it is unlikely there are hidden gaps or technical surprises, and the system parts can likely be built (or, better, there are solution approaches for each part). High-risk aspects of the concept have been identified, investigated, and there are likely solutions.

The concept likely meets stakeholder needs when the team can show that all of the most important needs have been addressed satisfactorily. The concept might not yet address all the minor needs, but it is likely that the concept will do so.

The concept provides the common vision for the work the team will do as it designs and implement the system. Everyone on the team should be in agreement on the concept, with no misunderstandings. The team will need to put some effort into ensuring that everyone involved shares this common understanding, and possibly correct ambiguous parts of the concept documentation. It is important that no one on the team has a separate agenda that does not match the system concept. I worked on one project where one of the sub-teams was focused on using the project to develop a radio component that they had been wanting to build, even though that component did not meet the actual system needs.

The team makes tradeoffs when deciding that the system concept is done enough to move on to specification and design: completeness versus making progress, risk of making poor decisions versus just getting on with it, risk of stakeholder acceptance or not. There is no recipe for deciding how to make these tradeoffs.

The concept does not have to be complete and finished before part of the team begins specifying and designing some system components. Some people can begin working on parts of the system that are clearly needed without waiting for every bit of the concept to be worked out.

Proceeding to specify and design some parts of the system leads to three risks. First, there is the risk that further concept work will reveal an unexpected problem with the part of the concept people are working on, leading to lost effort when that part has to be redone. Second, proceeding on part of the system can create momentum to continue with a poor concept or design choice just because resources have already been invested in it. Finally, starting on specification and design can lead to a perception that the concept is finished even though it is not. This can lead to people being redirected away from further necessary concept work, leaving problems undetected until much later in development.

At the same time, starting some specification and design work can be an appropriate choice when the risks can be identified and contained. If one part of the system concept appears certain while other parts are not yet worked out, and there is reason to believe that those other parts will not invalidate the worked-out part, then part of the team can move forward into specification and design without undue risk—though there will still be some risk of further concept development finding a surprise.

One possible outcome of concept development is that some aspects of the system are just not ready to be developed. The stakeholder needs might not be well enough understood to make a choice among alternative concepts, or some part of the system might need more innovation than the project can afford at the time. Sometimes the best approach is to decide to only proceed to build a part of the system right now, and plan for future evolution as technology or market information develop. See Section 34.10.3 for more discussion on this.

Sometimes the result of the concept development effort is finding that the project should not proceed. In that case, the reasons for not proceeding should be documented and reviewed, the decision to end the project made, and the team can move on to something else.

34.4.2 Determining scope

In most cases, much of the system’s scope can be inferred from the system purpose. The purpose usually defines who will use the system (who are thus outside it) and the major functions the system is to perform (and thus inside the system).

There are usually some grey areas, however. Customers may quietly assume that someone will provide some capability and not talk about it. Maintenance or operator activities, for example, are often omitted; are those who do these tasks part of the system, with roles to be designed, or are they like users that sit outside the system?

In the example project, all the explicit stakeholder needs were focused on performing a science mission, including a definition of the kind of sensor to be flown and the measurements to be taken. There was also a focus on communication to and within the collection of spacecraft. The needs included a rough maximum budget and timeline.

The project purpose did not, however, address how the science operations would work. It was left unstated how science activities would work, how investigators might be involved, or what would happen with sensor data when received. This meant that it was unclear whether these functions should be included within the system or not.

34.4.3 Determining environment

The environment consists of everything outside the scope of the system that the system interacts with in one way or another.

The whole environment environment includes different kinds of things, including the physical environment in which the system resides; the information environment, how information is supplied to the system or used; the user environment of people who interact with the system.

The physical environment includes the place where pieces of the system reside. This can include atmosphere, dust, heat, humidity, impacts, vibration, radiation, and electromagnetic waves. It also includes what the pieces of the system are attached to or where they are placed: rooms, racks, brackets. It can include aspects like the accessibility of a system component for security, such as how a room is protected.

The energy environment is related to the physical. This includes how energy enters and leaves the system: electrical energy, hydraulic or pneumatic energy, mechanical forces, or heat.

The information environment includes the information that is sent to the system or received from the system. This might be other systems to which this one is connected through networks. It might be information that is provided to other systems through things like signal lights or physical gates.

The user environment records who will use the system, and what they will expect it to do. This typically lists who the people are—or rather, their roles—and what they do with the system. The user environment also includes what kind of interface the people will have, whether a typical computer system or lights, sounds, and levers. This can be the place to record what the skill level of the users and the task environment. A quiet office with a laptop is quite different from the flight deck of an aircraft.

Systems may well have more kinds of environment. In general, if something is outside of the system’s scope and it can affect how the system functions, it should be considered part of the environment. For example: government system operations can be affected by the organization that funds them, and commercial systems can be affected by the training and oversight budget that the company using the system provides.

Working out information about the environment is related to working out the interfaces between things in the environment and the system.

34.4.4 Developing system structure

Working out the broad structure of a new system is an exercise in creative problem-solving. Many, many people have published and presented material on solving problems; I do not mean to reproduce or summarize that work here. Polya’s work [Polya57] is a classic work on this, and I recommend it as a foundation for problem-solving in general. This chapter adds some techniques specific to working out a system concept onto that foundation.

I have taken a reductive approach to defining and understanding systems in this work. This means that a system can be understood in terms by breaking up one thing into smaller parts, and reasoning about how those parts combine to produce emergent properties. (As noted in Chapter 6, there are non-reductive ways to design and understand systems; however, all the ones I know of are suited more for machine development and not for human understanding.)

This approach means that developing a system concept amounts to working out how to break the whole system into a few high-level parts, along with how those parts interact.

A potential parts structure is appropriate if it meets stakeholder needs. Tracing out how actions and information flow through the system in order to effect externally-visible behaviors is one of the essential checks for a correct or complete structure.

There are several problem-solving techniques that can help guide one toward developing a system concept. Most of these are common to many different kinds of problems; a few are specific to developing system structure.

In all these techniques, bear in mind that developing the concept is not a smooth progression toward a clear end goal. The work is more like searching through a maze, where some approaches will appear to be promising for a while until someone finds a flaw. There will be many potential approaches to different parts of the concept, and often none of them are a perfect fit to the need.

Be clear about the problem. The first step to developing system structure is to understand the problem at hand. Understanding the system purpose is a big part of understanding the problem; that is why understanding purpose comes before working out concept.

The next is to really understand what the system should do and be—not just knowing what’s in the purpose, but getting to understand the behaviors or functions it should provide.

I find that this usually means putting together a list of clear usage scenarios. A usage scenario can be a statement like “user X does action Y; the system responds with result Z”. This list doesn’t have to be exhaustive, but it should cover at least the most important behaviors in the system purpose. It also should not be too detailed at this stage of the work. The purpose is understanding, not complete specification.

I then take each one of the scenarios and try to understand what they mean. I try to imagine what each one might involve inside the system by walking through the flows of information or control to see what different components might need to do or what specific functions might be needed. This exercise often reveals gaps where there is a problem to work out. In the small spacecraft example, one scenario was sending a command for all the spacecraft to take some sensor measurement at the same time, having them take those measurements, and get the results to someone on the ground who would use them. That scenario has implications: a command sent from the ground can get to all spacecraft; spacecraft know what time it is accurately enough to take the measurements; each spacecraft can record its measurements; there is a way to collect and transmit those measurements to the ground. Stepping through the activities involved in this scenario helped me at the time to learn much more about how the system had to work, and thus how it should be structured.

As people work out potential solutions for how the system might be split into major components, these scenarios are helpful in evaluating the solution. One should be able to take the scenarios and show how they will cause activity to flow through the components. In the example, I worked out a potential structure for spacecraft software functions involving a communication component, a routing component, a message storage component, and a command execution component. I could look at the scenario in the previous paragraph and trace out how each of these components would receive input, take action, and send results to other components. Being able to trace that flow of activity gave me confidence that the component structure was at least feasible.

Besides looking at specific scenarios, systems often have modes of operation where the expected behavior is different. This can help to group usage scenarios and to suggest whether the list of scenarios is complete enough.

For the small spacecraft project example, the system would have a few operational modes. These included:

Pre-launch, when spacecraft are given final checks, final parameter settings, and installed into the dispenser on the launch vehicle.
Launch and deployment, when the spacecraft must remain inert.
Startup, after the spacecraft has been deployed from the dispenser and turns on for the first time on orbit.
Monitoring, when ground systems monitor the spacecraft for correct operation and fix problems when they occur.
Testing, when ground systems send commands up to one spacecraft and check that the commands are distributed to all the others, and the reverse path to collect results.
Science operation, when ground systems instruct the spacecraft to perform data measurement using the communication framework that has been tested.
Off-nominal operation, when one or more spacecraft are not functioning as expected and should be fixed.
Deactivation and disposal, when the mission is ending and the spacecraft are made safe to remain in orbit until their orbits decay and they burn up on reentry.

Working through the scenarios and functions often reveals behaviors that were not stated directly in the system purpose. In the example system, not all of the operational modes were apparent at the start. The need for an explicit deactivation and disposal mode was caused by US requirements for ensuring that a space flight mission does not create orbital debris that would pose a hazard to other spacecraft. Many behaviors needed during pre-launch, launch, and startup come from requirements on launch vehicle safety.

Break the concept into subproblems. The next step is to begin to divide the system into parts. The goal is to work out all of the high-level components in the system. This is where creativity is required in working out a system concept.

Each part should deal with a separate concern. It should deal with one piece of system function; it should not try to mix dissimilar functions together. Section 11.2 described criteria for dividing parts of a system into components: singularity of purpose; weak coupling with other things compared to within the component; and ability to replace one design of the component with another.

Many times one approaches this goal piecewise, in small steps, because the overall problem is too big. Polya calls this “Hunting for the Helpful Idea” [Polya57, p. 34]. The idea is to explore the problem to find some ways to begin tackling it. This can be by finding a similar problem that has already been solved and applying it to the problem at hand. It can be by drawing out part of the problem and looking for information that will guide toward a solution. One can tackle just one part of the system, ignoring other parts for the moment, in order to find a partial solution that might be combined with other partial solutions. Sometimes looking at things that constrain a solution can help focus attention on things that will help.

Finding similar problems. In some cases, there are common structural patterns that provide a guide for how to decompose a system. Space flight missions, for example, almost always decompose into a launch segment, space segment, and ground segment; the ground segment in turn decomposes into communications, operations, and mission or science operations. Most fixed-wing aircraft are organized around common patterns: body, wings, control surfaces, power plants, landing gear, and so on.

In other cases, there might be a behavioral pattern that applies. When dealing with communication protocols, there is a body of knowledge about how to avoid common performance problems (rate control, avoiding head-of-line blocking) or how to achieve security properties (authentication exchanges, nonces, cryptography).

Using common patterns is a benefit when the patterns match the system purpose. Common patterns can, however, impose a bias when the patterns do not quite match what the system needs. For example, using the common patterns for fixed-wing aircraft structure are not a good fit when the best solution is a blended wing body structure, which does not have separate body and wing structures.

Sometimes there are common ways to approach a problem. There are analytical methods for designing system safety and security, which I will deal with in Chapter 46. These provide a specific set of steps to follow to work out specific system needs.

For the example mission, we could draw on the history of many cubesat-class spacecraft [Cubesat22] that have been flown. We reused the basic system structure, consisting of a frame, electrical power system, flight computer, attitude determination and control, communications, and so on. Communication patterns among the spacecraft were based on previous work in the early Internet, when nodes were not always connected.

Sketching out part of the problem. Diagrams are powerful tools for exploring system design. I explore many system ideas by drawing block diagrams. As I write this, I have been working on a concept for a possible new system. I talked with the stakeholder to understand the key functions they expected; this revealed who the major users would be and what the system might do for each one. I then started imagining a set of internal functions the system might have to perform these functions, and sketched the way data might flow among those internal functions. This has begun to reveal possible ways to break those internal functions into high-level components. Of course, this led me to only one possible way to decompose the system; I later tried sketching from a different point of view and saw different things. (I discuss the value of multiple viewpoints below.)

For the example NASA mission, the highest-level hardware components were part of the definition: a collection of small spacecraft, plus ground systems to talk with them and a launcher to get them all on orbit. Sketching how the spacecraft would move relative to each other (a problem in orbital dynamics) and how they would need to communicate with each other was useful for trying out different communication strategies.

Trying many approaches. I find that the first idea I have for a solution is often not the one I end up with. In some cases I have started sketching approaches from scratch multiple times, each time starting with a different system function and seeing where that leads.

In one one project on UAS traffic management, my team needed algorithms that could generate a flight plan. The trajectory needed to avoid obstacles and stay away from other nearby traffic. We investigated perhaps a dozen algorithmic approaches to the problem; some were quite simple, others more complex. We looked at different path planning algorithms in the literature and evaluated how they would perform for our specific problem. In the process we learned more about the actual requirements for the path planning algorithm. The final choice was different from any that we knew about at the beginning.

This example is typical. We looked for different algorithmic approaches that people had developed. We evaluated some of them on paper; we prototyped others to see how they performed. We threw away several prototypes when we found use cases they would not handle, or when they did not have the needed performance.

Tackling part of the solution. For a system that doesn’t follow existing patterns as a whole, sometimes there is one part of the function that one can figure out. One can figure out a possible approach for one piece, then trace out system behaviors using that component. This can in turn reveal clues for how to tackle other parts of the system.

For example, in the example NASA system, I had an idea for how to address and forward messages between spacecraft. This involved identifying which spacecraft (one or multiple) need to get or send some information; it involved creating logical “channels” for senders and recipients and addressing messages to channels. I then started tracing what would happen when a command was generated on the ground, addressed to some channel. This led to thinking about how to schedule communication sessions between spacecraft, how to store and manage messages that are being shared, and so on.

This was only part of the solution, of course. It addressed nothing about how to provide behaviors for how to start up a spacecraft after deployment, or how to address off-nominal behavior. But it provided a start, and slightly reduced the complexity of thinking about other communication behaviors.

Dependencies. Working out one system behavior can reveal other necessary behaviors that are implied. The first function depends on other functions happening; those other functions depend on yet more functions, and so on.

The example mission was expected to communicate between spacecraft and a ground station. The ground station would have one antenna to communicate with all the spacecraft. The spacecraft lacked the power to keep the transceiver turned on all the time. This implies that each spacecraft had to determine when the ground station was about to come into view in order to turn on their transceiver. Knowing this event implied knowing where the spacecraft and the ground station were. The ground station location could be configured before launch, but the spacecraft location was dynamic. It could be determined three ways: using a GNSS receiver, by modeling the spacecraft’s orbit and computing a location, or by keeping the spacecraft pointed toward the ground (nadir) and listening for a message from the ground station. Modeling the spacecraft orbit implied determining what the orbit is, either from observation (by GNSS, for example) or by being told by ground systems. Of course, being told by ground systems implies communicating with the ground, which creates a circular dependency. Keeping the spacecraft antenna pointed toward nadir and listening for a message implies determining which direction the ground is, pointing the spacecraft appropriately, and keeping the receiver powered up to listen for the signal.

In addition, the spacecraft was using a directional antenna and the radio signals were transmitted at low power. A directional antenna means that the antenna receives (and sends) signals more strongly in some directions than in others. This implies that the antenna needs to be oriented so that a direction where it sends and receives most effectively is pointed toward the ground station, and vice versa. Being able to point the antenna implies knowing the direction toward the ground station, bearing in mind that the direction will change as the spacecraft flies past it.

In other words, the design of the communication and electrical power systems created a need for spacecraft guidance and attitude control capabilities.

Looking at constraints. The space of possible solutions for a novel system can be very large. Being able to focus attention on a smaller set of problems can help.

I worked for a while on the basic designs for UAV traffic management (UTM) systems. (I discuss this work as a case study in Section 63.7.1.) These systems provide traffic control for UAVs (drones), keeping them separated in flight from each other and from manned aircraft, among other functions.

Working out a flight path for a UAV and then ensuring that the planned path will stay far enough away from other aircraft are two of the key functions of such a system. One fundamental question was: what is the best structure for these planning and checking functions? This decision shapes how much of the rest of the UTM system is organized.

In that project, we expected that the UTM system would need to handle many flights occurring in shared airspace, that there would be multiple kinds of UAVs with different performance characteristics, and that there would be multiple organizations flying those UAVs. Those organizations would be expecting efficient and fair use of the airspace. Some organizations would be designing and building their own UAVs, and they would regularly be introducing new capabilities for their aircraft.

The need to support multiple kinds of UAVs turned out to be an important constraint.

The process of computing a possible flight plan depends on where the flight is supposed to go and the capabilities of the aircraft to fly, hover, or maneuver. One UAV model is also likely to have different abilities to communicate with ground services or its operator than others. This means that computing a flight path can require a detailed model of the UAV’s capabilities. In addition, different operators need different kinds of flight paths: one operator may want their UAV to fly from point A to point B, while another wants their UAV to fly a back-and-forth pattern covering a farm’s field.

There was an early design choice. Should the UTM system provide centralized flight path planning, or break up the planning into multiple systems? In the centralized approach, one common component would take in requests for a flight plan and compute a plan, in the process consulting databases of what flight plans were already approved and what aircraft were already in flight. This would have the advantage of assuring that all flight plans would be properly checked before being returned, and it could lead to computational efficiencies by coupling planning and databases. A centralized approach, however, would have a serious problem: it would need to know the performance model of every UAV and every kind of flight path any operator might want, which would be infeasible—especially if some operator wanted to innovate.

The constraint that the UTM system must support multiple kinds of aircraft and flight paths made centralized flight planning infeasible. We quickly focused attention on designs that made computing flight paths the responsibility of individual operators or of services they used, and investigated other ways to ensure that flight plans were checked properly. We did not spend any more time on centralized computation.

Evaluation. A concept is only good if it is fit for purpose. That means satisfying the stakeholder needs recorded in the system purpose. Each concept developed is checked against those needs to see how well it does, and alternative concepts can be compared based on how well they satisfy the purpose.

There are several kinds of evaluations to be done.

Checking that the system provides the functions needed by stakeholders.
Evaluating the likely performance or physical characteristics of the system: mass, resilience, expected environments, or operable lifetime. Aircraft concepts might address an estimated speed, range, and payload. Spacecraft concepts might address likely attitude control, delta-v capability, or lifetime.
Gathering evidence that the system can be safe, secure, reliable, or have similar high-level properties.
Determining roughly how much technical risk or invention will be needed, or alternatively how much can be re-used or acquired off the shelf.
Making initial rough estimates of the time, cost, and effort needed to develop the system.

Often, a back-of-the-envelope check is enough evaluation. Sometimes a simple model or prototype is needed to check out part of the concept.

Almost none these aspects can be evaluated with much certainty at the concept level. Most will require further detail that will come later with decomposition and design. The standard for these evaluations at concept time is merely whether it is plausible that the needs can be met.

When some parts of a system concept can be matched to a comparable existing system, it is generally possible to make quantitative estimates of cost, behavior, and other properties can be made with moderate accuracy. The advice from McConnell on estimation applies here: be honest about how much is and is not certain, and refrain from making precise commitments when the information does not justify it [McConnell09].

Evaluating the concept should reveal where there is technical uncertainty or programmatic risk. When one high-level system component will require innovation, that represents uncertainty. This information should be gathered and recorded along with the concept itself.

The concept is an appropriate time to begin to evaluate safety, security, resilience, and similar properties. Design for these properties starts as the concept is developed [Leveson11, Chapter 9]. The system purpose should include the foundation for safety or security by identifying what should be protected, which enables identifying hazards in the concept. Once the hazards are identified, one can evaluate the concept to see how they can be eliminated or mitigated at the high level. I discuss the process further in Chapter 46.

The process of evaluating a concept leads to discovering the strengths and weaknesses of that concept, especially a partial concept. Finding weaknesses can suggests ways to improve to find a better concept.

The concept evaluation should also produce evidence that the system purpose (stakeholder needs) has been met, in the form of traces from the items in the purpose to the parts of the concept that show that those items are addressed. This might be documented as a compliance matrix or table. The traces may need to be accompanied with some kind of argument for why or how the need is addressed. This kind of tracing is important when a reviewer or stakeholder checks the concept. This is essential, for example, when generating a concept that is for a response to a request for proposals.

Choosing between alternatives. There will usually be multiple ways to structure some part of the concept, and at some point one will have to make a choice between these alternatives.

During concept development, these decisions have a great effect on the later course of the project. Once people have been set on the course to specify, design, and build a system in a particular way, their effort has been spent and changing course to a different basic system structure means discarding that effort and the cost of changing the team’s direction (see Section 8.1.5 for a discussion on how teams have inertia).

In other words, these choices should be made carefully. Each alternative should be worked out to a similar degree of detail, and evaluated against the same criteria, as discussed in the evaluation section above. In some cases, asking for customer feedback on some of the promising alternatives can provide useful input. The project eventually needs to commit to one concept, and then stick with that decision—no factions trying to resurrect a different approach later.

At the same time, good decisions do not necessarily require high precision or detail in evaluation. The evaluation only needs to be good enough or thorough enough to be confident in the comparison between alternatives. If alternative A has ~10x the value of alternative B, one does not need more than one digit of precision to make a correct choice.

Biases easily creep into this kind of decision. It is important that the team evaluates each alternative on the same criteria. There is a particular risk in comparing alternatives based on a sum of weighted scores—so much for property A, so much for property B, and so on, with a weighting factor for each property. The scoring and weighting is fundamentally subjective in most cases, and it is possible to bias the results either by selecting arbitrary weights that predetermine the outcome or by inflating some scores for some approaches.

I was involved in one project that selected a basic design concept for an electric vertical take of and landing (EVTOL) aircraft, which would be able to take off and land vertically like a helicopter and also be able to fly horizontally at reasonable speed and range. The team enumerated a large number of options, with different numbers and arrangements of motors and wings. They evaluated them on the basic flight characteristics (speed, drag, efficiency) and on safety (such as resilience to motor failure). They eventually chose a particular design that did the best over all on these metrics, and the aircraft worked as expected.

Rationales and other information. Knowing why the concept is what it is can be as important as knowing what the concept is. Someone will need to revisit the concept in the future—to learn enough to make a change in the system, for example. The concept is likely to include the results of some decisions where it is not obvious why the decision was made as it was. The concept is also likely to contain structure whose purpose is not obvious, that took careful thought to work out. In that case, a component A may have some property that is important for component B, which interacts with A, to function correctly. When someone needs to implement or fix component A, they need to know that this property and relationship must be maintained for the system to work correctly.

This can be done by:

Annotated the concept with explanations to document some of this information.
Keeping a copy of the evaluations that have been done on parts of the concept and linking them to the appropriate parts of the concept.
When there has been a decision between alternative approaches, keeping a record of those alternatives and why some were rejected.

Other information is produced while working on the concept. This includes lists of where there are uncertainties and risk, and where specific technologies or off-the-shelf components can be used.

Completion. The work to develop the initial concept is complete when the concept can be shown plausibly to meet the system purpose. The uncertainties and risks discovered should be reasonable—none of them requiring invention of the implausible or miracles. The concept should be documented in a way that team members can read it to get a general understanding of the system they will be building.

Stakeholders review and validate the concept before it can be considered complete. This is a final check that the concept is correct.

The standard for completing a revised concept is higher. A completed revision also meets the system purpose and is readable as a guide to the system. A completed revised concept is also consistent with the way the system is decomposed into components, their specifications, and the further decomposition and design, so that it is an accurate guide.

The system concept should be kept up to date as the system’s design evolves because one of its purposes is as a guide or introduction to the system.

34.5 Artifacts

The concept development effort produces several artifacts: the concept itself, plus analyses, rationales, and initial lists of uncertainties and risks that have been discovered.

34.5.1 System concept

In other words, the concept is for teaching people about the system’s big picture. However it is organized, it must provide the information in a way that the expected audience (both new and experienced team members, stakeholders) understand. The concept must be accurate and complete to meet this aim. If the concept presents information that is different from how the system is actually built, people will be misled and make mistakes. If the concept artifacts omit some necessary information, people will not know there are things that need to be built or understood--again leading to mistakes either in evaluation or system-building.

At the same time, as I have mentioned before, the concept should remain the concept, not the whole design. It should be an introductory guide, not the specification. Doing otherwise usually means that the concept takes longer to finish than is needed, and that people end up deprived of the high-level guide.

There are three ideas that the concept artifacts convey: the system’s scope, its structure, and its behavior (Section 34.4). Each of these are connected, but need to be treated fully on their own as well.

The system concept can be recorded in many different forms and media. I have seen it developed as a prose document with some diagrams, as information maintained online in a wiki or systems engineering tool, and sometimes as a few diagrams with explanatory text. Semi-formal notations such as SysML can be helpful for documenting parts of a concept that are best explained graphically. (I have not found a purely graphical approach to work well; a couple projects I have worked on have tried to use SysML or UML as their primary means of documenting a concept, and the result was not understandable by most people who needed to use it.) Diagrams seem to work better when embedded in a textual framework that guides the reader through the information. The DODAF defines a standardized collection of viewpoints to record aspects of a system concept [DOD10].

A good concept document is often anchored by the “big scary picture”: one graphic that shows the major ideas in the system. The OV-1 diagram in the DODAF standard [DOD10] is one reference example.

Many projects record the concept in a single document. This document should be relatively short—a few tens of pages at the very most. The document is often structured as a reference to the system purpose, followed by the definitions of some major use cases, system scope or boundary, the high-level component breakdown, and finally discussions of how these components work together to support the use cases.

For the example NASA technology demonstration mission, I used a wiki system to document the system concept. The concept had two parts: the high-level system components (spacecraft, launch segment, ground segment) and a timeline for the mission. (I did not document the scope in the concept, because that had been defined in other mission documents already.) The timeline was divided into the mission phases. In each mission phase, there was a description of its purpose and a list of the major steps. Some steps themselves were broken down into timelines showing detailed steps. Each of the system components and timeline steps was written down as a wiki page, and there were extensive cross-links among the pages to help people find related information.

The system components part of the wiki was organized as a basic block diagram of the whole system, including the launch vehicle, spacecraft, and ground systems. Each component then had its own page with more information, with a description of what it would do or key functions it would have.

The timeline part of the wiki had one page for the overall mission phases, from pre-launch through decommissioning. Each phase then had its own page with details, including a cartoon of what would happen, a description, and list of important steps.

Finally, to repeat: the concept document should be short. I have seen many projects try to use the “concept” to work out and record the system specification. One can recognize when this has happened because the document runs to hundreds of pages, includes lots of details, and is usually abandoned shortly after system development begins. Instead, keep the concept as a high-level explanation. Let the details come in the system specification, which will be long, tedious, and written in stylized forms that are not easy for the uninitiated reader to understand.

34.5.2 Analyses

When it’s baselined, the initial concept should be likely feasible, likely mostly complete, and likely to meet the system purpose. I emphasize “likely” because at the beginning of a project that’s the best that can be done. As the project progresses, the concept should be maintained so that it matches the actual design and implementation, at which point it should be known to be feasible, complete, and compliant because the design and implementation are.

Feasibility of behaviors.

Is there a plausible story about how each behavior can work?
Where are there difficult things that might come up?

In the example mission, most mission steps are similar to those in other cubesat missions. However, we identified three potentially difficult activities: ground communication, power management, and crosslink communication.

Ground communication differed from previous missions in that there would be multiple spacecraft operating, and more than one might be in view of the ground antenna at the same time, potentially causing interference.

Power management differed because we had more components on board and could not operate most of them continuously. This interacted with communications.

Crosslink communication was the most difficult. Multiple actions had to go right for crosslink to work: basic ability to close the link, which depended on orbital dynamics; antenna design; and spacecraft attitude control. These depended on being able to point the antenna on both spacecraft so the other spacecraft was in view. That in turn required knowing where the others were, which had to be worked out from ground observations. We didn’t completely solve this problem before the project ran out of funding.

34.5.3 Evidence of compliance

The standard for compliance is that all of the purpose is met, and that the system contains nothing extraneous. The system purpose should already list all of the identified stakeholder needs.

The evidence of compliance shows how the concept meets each of the stakeholder needs in the system purpose. This can be simple, like a table mapping items from the purpose to parts of the concept. Often this is clearer with an explanation of how or why some part of the concept ensures a need is met. Some items in the purpose won’t be addressed directly by the concept, such as workmanship or project management requirements; these should be noted as being addressed outside the concept.

Compliance also means arguing that there is nothing extra in the concept. This can be addressed by a mapping from each part of the concept to the items in the purpose it addresses. Sometimes the mapping won’t be direct; for example, a security need translates into a set of more specific objectives, and the concept addresses those specific objectives.

34.5.4 Rationales

The concept itself documents what the concept is, but not why it is the way it is.

Providing a rationale or explanation documents why choices were made the way they were. When someone comes along later to make changes to the system, they can learn about the original design thoughts and take those into account as they make changes. The rationale also records reasons that may not be apparent to later readers.

The rationale records subtleties in the concept. For example, when one part of the concept is intended complement another part, and the two parts need to be consistent for the combination to work. This situation is often not obvious from looking at either part, and adding a rationale will make the reader aware of the connection between the two parts.

Rationale is often written in prose, sometimes with diagrams. The information can be attached to parts of the concept, since the rationale is usually related to why some component is the way it is.

34.5.5 Uncertainty and risk

People often find uncertainties and risks while they work out a system concept. Other risks are identified while working out the system’s purpose. This information should not get lost—it should form the basis for later in the project when that information is used in planning.

The concept is complete when it is likely feasible and compliant, meaning that important risks have been identified, and most have been addressed. One needs to track risks and uncertainties to be able to check that the ones that matter have been addressed.

34.5.6 Configuration management

Because the concept provides a high-level guide to all the rest of the work in the project, people need confidence that they are looking at an accurate version of the concept. The concept will evolve as the project goes on, as stakeholder needs change, and as the team refines the system’s design.

This means that there will be versions of the concept that are in progress, currently baselined, and outdated. A baselined concept should be consistent with the matching system purpose artifacts and corresponding specification artifacts.

A work-in-progress concept should, therefore, get an explicit review and approval before being baselined, and its artifacts should be maintained in a configuration management system that reflects these versions.

34.5.7 Artifact maintenance

The concept will evolve of the course of a project. There are two reasons the concept might change once baselined.

First, the system purpose can change, and the concept changes to reflect the changed objectives. When a stakeholder requests a change to the system purpose, the process in Section 33.10 applies. That process includes evaluating a change request to decide whether to agree to it or not. Evaluating the request includes evaluating the effect on the system—what technical changes will be required, and the effort or cost involved in making those changes. Updating the concept is the first step in working out how big the change will be. In many cases a change to purpose and concept will be adopted and baselined together.

Second, the system will evolve as the team works into the details. They may find that some high-level design approach does not work as expected, and take a different approach instead. The team may come to understand part of the system better than at the beginning and find different ways to explain the high-level picture. In these cases the team should keep the concept up to date with the actual system so that the concept remains an accurate way to learn about the system.

34.6 Feedback to system purpose

In practice, a team does not work out the system purpose completely then turn attention to the system concept. Instead, they are likely to work on the two together. As they learn about objectives in the system purpose, they add necessary features to an evolving concept.

Information flows the other way, as well. As the team works out concepts for the system, they begin to understand what is possible and what is expensive. The team may find that some stakeholder needs are not feasible: they would require some capability that is formally impossible or that will require significant innovation that couldn’t be done in the customer’s time and budget. (I worked on one project where an executive asserted that the team could develop algorithms well known to be impossible.) This feeds back to work on the system purpose, and should prompt a negotiation with a customer about what is possible.

Developing the concept also leads the team to discover other potential requirements. For example, the stakeholders might initially make a broad statement about security (“we want it to be secure”). Working on the concept can prompt more specific questions to the stakeholders. What kinds of security hazards concern them? What are they willing to do for different levels of protection? These questions lead to updates to the system purpose, which will then match the concept.

34.7 Validating concept

The team will work to ensure that the concept satisfies the system’s purpose, as recorded in system purpose artifacts. The evidence of compliance, discussed above, records how the team believes that the concept matches the purpose.

However, this means that there is one step of indirection between what the stakeholders actually said and what is in the concept. There could have been misunderstanding in recording stakeholder needs, or there could have been problems translating the purpose into concept. For example, on one project, stakeholders used the term “real time” to describe one necessary feature. Unfortunately “real time” has multiple interpretations in different engineering disciplines: performing work interactively as opposed to off-line versus performing work to meet strict deadlines, for example. The system purpose included “real time” in its objectives; different concepts complied with different definitions. It was only when reviewing concepts with the original stakeholder that the ambiguous definition became clear.

Validation comes directly from the stakeholders when they exist. The point is to check that they understand and agree. In cases where a stakeholder does not exist yet, such as for visionary projects (Section 33.3), someone independent of the project acts as a proxy for future stakeholders.

Some stakeholders can review the artifacts that document the concept, as long as they have people on their teams who have the background to understand the media and language used in the documentation. When a concept is presented to a customer as a proposal, they will either read the concept directly or read a version translated to their language.

Other stakeholders will not read the concept directly. They may need to have the concept presented in language that they are comfortable using. Providing the material interactively so that they can ask questions can also help.

I have used three main approaches for validating the concept with those customers: a translated summary, presentations, and acting out scenarios—often in combination.

Summary documents, written using the stakeholders’ language, are more or less equivalent to the concept artifacts except that they are written for the stakeholder to understand. They are meant for the stakeholder to read on their own. Summary documents may leave out detail that a stakeholder will not care about, such as design details that matter for guiding the team but do not define system functions. The summary should, however, call out information that a stakeholder will care about but may not realize until it is pointed out. This might be an implication that the project team has discovered: having function A requires compliance with regulation B, or function A affects how function C can work.

Presentations are like summary documents, in that they are written for the stakeholder to understand and usually leave out details that the stakeholder does not care about. Presentations are interactive, where team members interact with stakeholders to tell them about the concept. The stakeholders can ask questions, and the team members can confirm that the stakeholders understand parts of the material. The team members can add extra explanation when the stakeholder needs it. Bear in mind that a presentation is an action performed by team members; a presentation is not a summary document written as a slide deck that a stakeholder reads on their own.

Acting out scenarios complements presentations. Team members prepare different scenarios that the system would experience; they and stakeholders can role play different system parts. The stakeholder can get an appreciation for situations that they might normally not think of. A colleague and I acted out some scenarios about how two organizations might negotiate a resource access problem. After we acted out a normal scenario, my colleague began to act out ways that one of the organizations could negotiate in bad faith. The stakeholder had had a mental model of organizations always behaving cooperatively; they appreciated how things could go wrong after acting out the scenarios. They then understood why we had features in the system concept to detect “cheating” and incentivize good behavior.

34.8 Reviews and approval

In the reference life cycle, concept development ends with a conceptual design review (Section 28.4). Passing this review means that the team is ready to move onward to system specification (Chapter 36) and design.

The team is ready to move forward when they and stakeholders have confidence that the concept:

Two reviews are needed: an external review by stakeholders and an internal one by the team and others. The stakeholder review validates that the concept can meet their needs. This often requires the reviewers to look at analyses of the concept, not just the concept itself. The internal review checks both compliance with system purpose and feasibility.

A feasible concept is one that the team has or can get the resources and skills to do the work needed. Making this judgement implies that the concept is complete enough at least to name the areas of work involved.

A feasible concept is also one with plausible ways to implement all of the parts of the system. This does not mean necessarily that there are definitely solutions for everything; that will only be determined when the full system design is done. Some parts of the system can have uncertainties, but the concept should include some reason to believe that those uncertainties can be resolved. If the uncertainties are too great, the review may decide to move forward but plan for further exploration or prototyping early in the project.

The concept is likely complete when it includes all the parts needed to satisfy the system purpose. If some stakeholder need requires

n

behaviors in the system, the concept has all

n

behaviors, and all the parts needed to implement each behavior are included.

A good concept is one that is better than other concepts that could potentially meet system purpose, and one that addresses competition from others. “Better” has three aspects. First, one concept is better than another at meeting system purpose if it can meet more stakeholder needs than the other, or has more flexibility to adapt if stakeholder needs change. Second, concepts can be compared on design esthetics that are proxies for desirable system properties like understandability, flexibility for change, or customizability. These esthetics include modularity that encapsulates concerns or the use of well-understood design patterns. Third, a good concept includes little or nothing extra that does not clearly support the system purpose.

A concept that passes its review meets the system’s purpose, meaning that it can satisfy all the stakeholder needs identified in the purpose. As noted earlier (Section 34.6), the initial list of stakeholder needs may or may not be possible, and different approaches might address more or fewer of those needs. By the time the concept gets to review, the system purpose should be revised so that the concept can support all of the needs in the purpose. Satisfying the system purpose at this review should include the stakeholders validating the concept, not just checking that the concept checks all the boxes of the purpose artifacts.

This review, like all reviews, is done by people independent of the main project in order to detect blind spots or biases that the team may have developed. The reviewers can include stakeholders when validation is part of this review; if the validation has already been done, stakeholders do not need to be part of the review team.

The conceptual design review is often a time when a project decides whether to continue with development or to stop work on the system--the go/no go decision. This is a point when the project evaluates the feasibility of building the system and the cost-benefit tradeoff in the work. If the uncertainty in building the system or the cost is too high, then the team decides not to invest more time and resources on the project and turns to some other work.

RFP-driven projects. In an RFP-driven project, where the team develops a proposal for a customer in order to get approval and resources for development, a proposal development phase follows concept development. The conceptual design review determines whether the team has a concept that is good enough for them to proceed on to writing the proposal.

The concept review is also the point at which a team decides whether to pursue completing and submitting a proposal. This decision depends on:

Whether the team has the resources to pursue the work,
Whether the team has a reasonable chance of winning a contract, and
Whether building the system would be worthwile.

Determining whether the team has resources requires estimating the resources needed. For the first steps of purpose and concept development, this may be small, perhaps one or a handful of people to gather information and to get an initial understanding of what the customer wants. As the work progresses, more resources will be needed—to gather more information, to do concept development, to gather competitive market data. At each step of the process, it will become clearer how many people or other resources are needed for the next step of developing the proposal. At the same time, the team must be able to estimate how much resource will be needed to build the system if they win a contract. This will be unknown to start, but as the system concept and architecture work move forward the estimates will improve. The team must develop the architecture enough to be able to determine prices to charge the customer and to be able to determine if the team will have the capacity to do the work. These analyses grow out of the concept and later architecture documents.

Determining whether the team has a reasonable chance of winning is a combination of knowing how the customer will judge proposals, how strong other teams are likely to be, and how well this team can satisfy the customer. This information is gathered in the customer definition document, the competition document, and in how the concept and architecture respond to the customer’s objectives.

Finally, determining whether building the system can be worthwhile depends on the needs of the organization and funder. Does the organization require a particular profit margin? Is there a minimum or maximum contract price that is considered “interesting”? Does the system fit within the organization’s business strategy? These kinds of questions are captured in the objectives of the organization or funder, and analyses use these objectives, concept, and architecture documents to develop an answer to them.

Developing proposals is a complex specialty, and much has been written about it. We refer the reader to ! Unknown link ref for further reading.

34.9 Changing the concept

Changing the concept can be an expensive and error-prone task, but it must be done sometimes either to reflect changes in system purpose (Section 33.10) or to respond to feedback as the team develops the system (Section 34.5.7). A change can be expensive because the concept sets the pattern for the structure of all the components that make up the system. One change to the concept can ripple through many parts of the system, potentially affecting a great many components. On the other hand, if purpose has actually changed or a problem has been found, the concept does need to be changed and the system design adjusted to match.

The team must follow a careful process to make changes to the concept after it has been baselined. Some amount of work will have been done on specification and design; it may have progressed even into implementation. The team needs to trace out the effects of the concept change on all of the specification or design work that has been done, and then adjust those accordingly.This can result in some components being dropped and other added. It can change the functional specifications of other components. It the change happens late in the project, what parts of the system ahve been implemented and verified, the changes may propagate all the way to updated software, hardware designs, and test cases. If the team misses some part of the system, parts of the system might not be consistent with each other and not work correctly together.

In a couple projects, I watched people get confused as the target customer needs and concept changed. People did not catch when some important system concept changed and they continued to design and build to a concept that no longer applied. This disconnect led in turn to hard-to-find system flaws: people designed accurately to the wrong objective, and sometimes it was a long time before someone caught the mismatch. This led to error-prone redesigns of parts of the systems and extra cost.

The implication of this change is that changing the system concept is both a technical matter and a team communication matter. A well-functioning team communicates clearly when the concept changes and provides safety net mechanisms to catch when someone has missed a concept change.

Changes to the concept should begin as tentative works in progress, distinct from any baselined version of the concept. To baseline a concept change, the new version must be reviewed to ensure that it meets the review criteria identified above: feasibility, completeness, compliance. If the changes are extensive, the change review can require nearly as much effort as the original concept review. If the changes are confined to just part of the concept, the review can often be limited to only those parts of the concept affected by the change. Nonetheless, the updated concept should pass the review before being baselined.

The initial concept only needs to be likely feasible, complete, or compliant. As the project development continues, the bar of likelihood raises; by the end of the project, the concept must be fully feasible, complete, and compliant.

Not every request to change the concept will be granted. As with changing the system’s purpose (Section 33.10), the team should evaluate the effects of a concept change before committing to it. Indeed, a change to purpose and change to concept may go together: someone might propose a change to the stakeholder needs the system will address, and as part of evaluating whether to make that purpose change the team develops an update to the system concept in order to estimate how costly the change will be. The changes to purpose and concept may then be accepted and baselined together. On the other hand, if the updated concept shows that the change is not reasonable, then both the changes to purpose and concept may be rejected together. Because of this, it is important to keep tentative, work-in-progress versions of the concept separate from baselined versions.

34.10 Using the concept

34.10.1 Concepts in a visionary project

I defined a visionary project in Section 33.3 as one that does not yet have a specific defined customer or market segment.

Ideally, a visionary project works out potential market segments (sets of potential customers), and investigates those to determine likely wants and needs. The system concept is designed to satisfy this likely system purpose.

Customers often want a great many things that can’t reasonably be satisfied in one system, at reasonable time and cost. Developing a concept is an opportunity to see which needs can be satisfied, and at what cost. Two concepts can be compared by how many customers each concept might satisfy well enough, and thus roughly which approach will be more popular. Concepts can be evolved to reach a cost-market size tradeoff that the project considers best.

A system concept that is aimed at a hypothetical customer is likely to need to change as the project learns more about their customers and narrows down their needs. As noted earlier, the process of deciding to commit to a new concept must be handled carefully so that the entire team works in concert with the changes.

I have joined some projects that decided to begin design and implementation before working out the concept. In one case they did this in order to satisfy potential funders by showing some kind of progress; in another case, most of the people involved were specialists who wanted to get working right away on their part of the system. In both cases the team designed and started building components that did not meet customer needs as those get identified. Both teams established a culture of working based on narrow, incomplete understanding and they only identified and fixed systemic problems with unhappy team upheaval and significant cost. They both had trouble meeting actual customer needs because they did not connect the work on individual components to the system structure and purpose.

34.10.2 Concepts in an RFP-driven project

An RFP-driven project is one where a potential customer issues a request for proposals, and the team develops a proposal in response. The customer request lists their needs and hence defines the system purpose, though the team often will work with the customer to clarify the request.

The project responds to the request with a proposal. The proposal defines what the team will produce, show that it meets the customer needs, and estimates the cost and time involved. The customer uses these proposals to choose which team will get a contract to design and implement the system.

The proposal includes the system concept: the concept defines the system that the customer will get, at the high level.

Because the proposal also includes cost and time estimates, the team usually has to go further than the concept itself and include some high-level design. This design improves the basis for estimating cost and time. Those estimates need greater certainty than what can be worked out from the concept alone.

The team also decides at some point whether to proceed to develop the proposal, and whether to submit a proposal. These decisions depend on whether the team has a viable concept or not, whether the team has the skills and resources to build to that concept, whether the concept will meet customer needs, and whether the project will meet other stakeholder needs, such as profitability. These go/no-go decisions use analyses done on the system concept.

34.10.3 Incremental growth and descope options

A system concept does not have to be all or nothing. A project can choose to design and build a system in phases, starting from something simple and adding capabilities over time.

As noted in Section 33.6, the stakeholders may have asked for more than is achievable with the time or funding available. Concept development is the time when the team does the work that can reveal that the stakeholders are asking for something too complex, including what the complicated parts are and why they will be hard to build.

One way to handle this is to plan to build the system in multiple steps. Choosing a concept that grows over time is a good idea when there is value in getting something in front of a customer quickly and getting their feedback. Planning to grow over time can also address the need to satisfy potential funders who want to see progress and market acceptance before committing additional investment.

A project can also do the opposite: define a system concept, and plan potential ways to descope or remove capabilities if the work proceeds more slowly or takes more resources than expected.

It is worth doing this planning early, rather than sometime later in the project when there is an emergency rush to fix a problem.

Finding options for growing or shrinking a project over time depends on three things: importance, viability, and dependencies among features. Importance involves ranking which parts of the purpose (needs) are least important, and that can be removed from the plan while leaving the most remaining value. Viability is whether the system will be useful with those parts removed.

In the example cubesat system, we had a request to include a capability to control spacecraft attitude to manage atmospheric drag, especially near perigee (the lowest point in the orbit, when atmospheric drag is greatest). This would have been used to try to manipulate how far apart the spacecraft drifted, and to extend or shorten their lifetime on orbit. This capability was not essential to the basic communication and science operations of in the mission, and it was removed from the concept and negotiated out of the system scope. In other words, this feature was of low importance and did not affect viability.

Teams often plan a sequence of things that can be removed or added, reducing or adding to the system scope and purpose step by step. I have developed a set of systems engineering tools over the years to help me build systems following the models in this book. That system started as something simple that could manage tables of requirements (Chapter 37). As time went by I have added capabilities one by one to model the component breakdown (Chapter 41), various ways of specifying component behavior and interaction (Section 42.3), and the information exchanged during interactions (Section 42.4). At each step the system has become more useful, but I didn’t wait until everything was perfectly worked out before starting to use it.

Reduce the system capabilities too much, however, and it isn’t useful to a customer. If there are multiple customers—such as for a visionary project that is aiming for some market segment—reducing the system features can reduce the number of customers who will find the system viable. This can lead to an initial version being viable for only a few customers, with the number of customers growing as more features are added in over time. This can be useful, as it gives a team time to build a solid system foundation and validating customer satisfaction before investing in more features.

In one spacecraft system I worked on, project leadership wanted to launch a minimal spacecraft and add to its software capabilities over time. The team was far behind schedule developing their system, and they were looking for any ways they could get more development time while still meeting the launch schedule they had already paid for. The spacecraft would launch with a basic software image that could operate the spacecraft and check it out on orbit, and the team would then upload new science operation or data analysis capabilities over time. This would give the team more time to get the software written and tested before it was needed, and it would allow for adding new capabilities that weren’t imagined when the mission started.

The difficulty they encountered was that incrementally adding new software capabilities depends on some baseline functions: the ability to communicate well enough to send up new software, and the ability to install and run the new functions. The ability to upload new software implies a communication capability that can move significant amounts of data from ground to the spacecraft. Such communication implies a moderately high bandwidth transceiver, antennas to match, the ability to point the antennas at ground stations, and the ability to schedule ground communication. If the team did not build these in from before launch, it will be nearly impossible to add them later. In other words, adding all the new functions while on orbit depended on significant capabilities being available from the time of launch.

34.10.4 Relation to other lifecycle patterns

The concept development work described in this chapter includes steps from both the NASA Pre-Phase A and Phase A project phases. Pre-Phase A includes developing the concept of operations, while Phase A includes developing the mission architecture. These phases of the NASA lifecycle include many other steps, such as developing various management plans or identifying stakeholders and mission needs.

Chapter 35: Proposals

35.0.1 For RFP-driven projects

An RFP-driven project is one where a customer is asking for proposals from development teams about how they will design and build a system. The customer is usually asking for multiple, competing teams; the customer will choose one or more teams for a contract to build the system.

The customer writes a request for proposals (RFP) document that defines both the characteristics of the desired system and how the customer expects to judge between multiple competing proposals, if there are any. The RFP should thus document the customer’s objectives. In many competitive acquisition cases, the RFP must be the only official source that a team can have so that all proposing teams work from the same information—thus treating all teams equally.

When deciding to respond to an RFP, the team must learn what acquisition rules the (potential) customer is using in order to determine what restrictions to follow when communicating with the client. The team must also learn how the customer makes decisions, including who makes the decisions, who influences the decisions, and how the decision will be made. When responding to a commercial RFP, this can be easy: there is a contact who sends out the RFP and who can answer questions as needed, there is someone they work for who reviews and decides whether to accept a proposal or not, and the decision is based on what the decision-maker thinks meets their needs at the best price. For a US Government agency RFP, on the other hand, the decision process is defined by Federal Acquisition Regulations and by the agency’s supplemental rules. There are formal processes for submitting questions; there is typically a defined scoring and weighting system that a formal review team must use to rate each proposal.

The information gathered about how the customer communicates and makes decisions should be included in the Customer Definition document.

Whether one can get clarifying information or not, the concept documents should include documentation on where the team has made assumptions or interpretations of the RFP source material. These points are matters where there is greater than usual risk that the team’s assumption does not match what the customer is thinking. This means that there is a higher than usual risk that the concept or design that the team proposes will be interpreted differently than what the team means—and so it is worth putting extra effort into making those parts of the proposed concept or design as clear as possible.

There are two end results of the process for responding to an RFP: first, a decision whether to complete and submit a proposal, and second, submitting a proposal if the first decision is positive.

35.0.2 Proposal (RFP-driven projects)

Purpose. A proposal, in the sense meant here, is a document that is sent to a potential customer in response to a request for proposal (RFP) that the potential customer has issued.

A proposal needs to make four cases to the customer:

That the team has a technical solution that meets the customer’s needs
That the team has the capability to actually produce the system
That the price for the system will be acceptable
That the team has a better offering than its competitors, if any

The proposal derives from the work done during concept development, but usually also must include initial system specification and design work. This initial technical work is needed both to be able to explain to the customer what they would be getting if they choose this team to develop the system, and to generate a reasonable price for building the system.

Many processes and guidelines for proposal development have been published over the years, and we refer the reader to that large body of literature for details.

Chapter 36: Specifications

36.1 Purpose

Specification is about recording how a component (or system) should behave or the structure that the component should present. It only documents how the component appears from the outside, as a black box; it does not specify how the component achieves these ends. A specification derives from the less-formal concept for the system or component.

A specification provides a simplified and abstract view of a component. This abstract view allows one to reason about how the component will work with other components. Without the abstract view, one would have to analyze the details of a component’s implementation to determine whether it will interact properly with another. While that is possible, the work of figuring out how the component will behave only serves to reconstruct design information that was originally worked out when designing the component. The reconstructed information will not necessarily match the information used during design, and the effort is wasteful.

A good specification records the intent and assumptions that went into working out what the component is supposed to do. This information helps the component’s implementer and designer to check that they understand what they need to build, and to check that the specification matches the intent. These assumptions also help people understand how a component might need to change when part of the system is redesigned—to add a new feature, for example. A record of the intentions helps people who come along later to understand the system, and the particular component’s role in it.

Finally, a specification serves as a sort of contract between a component and the rest of the system in which it functions. The people building the component in question can proceed to work on their component with confidence that the result will likely integrate correctly into the system as long as they build to that contract. The people building other parts of the system can likewise proceed with reasonable confidence that when they go to use the component, it will do what they expect.

36.1.1 Good specification properties

A specification is used for several different tasks by different people over the course of a project. A good specification needs to be structured and contain the information needed to support these people.

Specifications should be clear and unambiguous. Each of the people who will read and use each specification need to come to the intended meaning of the specification.

They should be testable. Someone using the specification should be able to look at a design or implementation and determine whether it is compliant with the specification. That does not mean that determining compliance is easy; it only means possible. Sometimes the most that is possible is to build a body of evidence that a design is highly probably compliant. For a specification to be testable, however, the specification can’t contain statements like “approximately” or “fast” or “heavy”; it needs specific values that define what “approximately” (“+/- 10%”), fast (“at least 20 m/s”), or heavy (“greater than 5 kilograms”) mean so that compliance is not a matter of subjective judgment that can differ between two different people.

The specifications need to be organized. A specification is no good if the people who need to use it don’t know it exists or can’t find it. A specification is also not useful if the people who need it can’t tell whether it is currently applicable, outdated, or a speculative proposal. Specification should be kept in one place where everyone on the project can find all of them, and they should be maintained under configuration management.

A good specification is minimal. It addresses the needs for the system or component that have been identified in the concept work leading up to the specification, but it does not add other elements that are not relevant to the identified needs. (Note, however, that the process of developing a specification can often reveal needs that were missed in building up the concept and CONOPS. When those gaps are found, the concept and CONOPS need to be updated as well as addressing the gap in the specification.)

36.1.2 Specification versus documentation

Specification and documentation play different roles. Specification is a record of what something should be, while documentation is a record of what it has been designed and implemented to actually be. Specification deals with the black-box, external behavior, while documentation deals with the internals of the component. The documentation should connect decisions about the component’s internal structure to the external behavior or structure documented in the specification.

36.1.3 Specification needed to scale a project

A small project, implemented by a very small group of people over a short time and thereafter left alone, and that does not provide safety- or security-critical functions, does not necessarily need specification.

Unless all of those conditions hold, some level of specification is necessary in order to communicate between people and across time.

Sidebar: The role of experience substituting for specification

Every specification is written in terms of some level of common knowledge: language, jargon, or subject matter. When writing a specification, one strikes a balance between what is assumed and what is explicitly recorded. Formally, it is not possible to truly fully specify something because that specification is always based on some amount of shared axiomatic knowledge. Such formal specifications also become difficult to understand as the detail overwhelms clarity. At the same time, assuming too much leaves room for misunderstanding.

In small and fast-moving teams, there is a temptation to rely on experience rather than writing down needs, especially when the same person specifies and implements a component. This can work in the short term—but not in the long term, as people change or other people start to share the work. Leaving needs implicit rather than documenting them also disadvantages early career engineers: they do not yet have the experience that would fill in the gaps, and if they cannot do some guided design or implementation work they will not get the experience they need to do more on their own later.

Leaving needs implicit can be okay if it is a transient condition, and needs, specifications, and assumptions are recorded before they are forgotten.

36.2 Specifications and systems

A specification defines the metaphorical shape that the component should have in order to fit into and support the system.

A specification treats the component as a black box: it considers only how the component should be seen from the outside, without determining how the component’s internals should be designed or implemented. One way to look at the specification is that it defines a contract between the system and the component: if the component behaves according to the specification, the system should work correctly as a whole.

A specification may define behaviors or attributes that in effect narrow the range of possible designs, possibly to only a single design. That situation in itself does not make a specification invalid. However, the specification should not include definitions that are not strictly needed to record needed external behaviors solely in order to constrain the design.

After a component has been specified, design of the internals of that component begins. The internal design often uses sub-components. The designers will develop specifications for the sub-components.

This process repeats recursively to lower and lower components, until one reaches components that have no further sub-components. The result is a tree (or possibly a DAG) consisting of alternating layers of specifications and designs. (This has been called the “layer cake model”.) The design of one component (or the system) responds to its specification. The specification for subcomponents depends on the design that has been selected for the component—the design determines both what subcomponents there are, and how they are to work together.

36.3 Example

Some years ago, I worked on a rack-mounted computing system that had high reliability and uptime goals. A decision was taken to include a battery pack in each server assembly, so that if the mains power went out the servers would have enough time to record their state on storage before shutting down.

Consider the specification for the battery pack. It may seem simple—provide enough power to run the server assembly for some period of time—but the actual specification contains several subtle elements because its function is entwined with other system-wide reliability and safety behaviors.

Here are some of the system behaviors that affect the specifications for the battery pack:

These are rough objectives for the server assembly as a whole. These translate into specifications on the battery pack itself.

These example objectives are not all of what would be needed for a server battery pack, but they illustrate several of the kinds of concerns that the battery pack’s designers will need to consider. These rough objectives must be turned into more precise specifications in order to guide the designers accurately. For example, some of the statements above use subjective words like “nominally” that need to be made precise. Other statements are too general and need to be decomposed into a set of more specific statements.

36.4 Kinds of specifications

“Specification” is a deliberately broad term, encompassing many different ways of recording what something should be or do (and why).

Many people assume that “specification” means “requirements”. While requirements are one kind of specification, they are not the only one—and requirements are not generally sufficient by themselves to record all the information needed about behavior or structure.

36.4.1 Combining multiple kinds of specification

In practice I have found that no one kind of specification meets all needs, and have used multiple kinds of specification together.

Generally, each kind of specification we use meets the good specification objectives of being clear and testable, as defined earlier.

Mixing multiple kinds of specification, however, requires care in organizing the specifications. Different kinds are often written and stored in different tools (a tabular tool for requirements; a CAD tool for mechanical drawings). This easily leads to a situation where a practitioner cannot find all of the specifications to which they need to be paying attention.

One way we have addressed this is to use a table of textual requirement statements as a primary specification, and include requirements like “the component shall comply with state machine X”, including a reference to the drawing of the state machine. Using a tool that makes all these forms accessible through one common user interface helps make this convenient for users. Using tools that can perform configuration management across all the different forms of specification also helps.

36.5 Using specifications in a system

We first look at how specifications are developed and used from the outside: from the perspective of those who are concerned with how a component fits into the system, and not with what the specification means for the design internal to a component.

A specification for a system derives from the objectives and CONOPS developed during the system concept development phase.

The system-level specification leads, in turn, to a system design and then recursively to the concepts and specifications for components in the system.

36.5.1 Building the specification

This is the first step in using specifications. The specification developer looks through all of the conceptual material assembled for the system or for a component, and organizes and formalizes it to make a specification.

In practice this does not happen all at once. People develop the various kinds of objectives that lead to the specification iteratively, and parts of the specification will be developed as the objectives and concept becomes clear. As people develop the specification, they will identify gaps in the concept, which will lead to improvements in the objectives and CONOPS and in turn lead to updates to the specification.

36.5.2 Evolving the specification

The needs that a system solves change over time. New capabilities get requested. Regulations evolve. Problems with the system are found and need to be fixed. All of these can lead to changes in the concept and thus to changes in the system specification.

The concept and design of components also changes, and for similar reasons. As well, a component may have a perfectly adequate design, but it may become outdated because subcomponents become unavailable. This leads to a redesign of a component, inducing new specifications for subcomponents.

It is important to follow an organized process when a specification changes. Many process standards recommend specific approaches; for example, ISO 26262 [ISO26262] specifies that any change to a system must begin with an impact analysis, which determines how a change to objectives or specification propagates through the design of the system, and downward through the hierarchy of components. Standards like that also specify that the specifications and designs be maintained under configuration control so that everyone can know whether a change is a work-in-progress proposal or has been committed to.

36.5.3 Validating the specification

The specification must reflect all of the needs identified in the concept from which it derives, and the specification must not add needs that do not appear in the concept and objectives. Before a specification can be declared complete, someone must go through all the material in the concept to check that the specification accurately reflects each of the identified needs or objectives.

A specification validation exercise can also help identify gaps in the objectives. Checking the specification often involves someone who was not part of developing the objectives and CONOPS; a fresh perspective can lead to asking questions about the objectives or the specifications that in turn lead to discoveries of topics that are missing.

36.5.4 System consistency

As the system design grows and more and more components are defined and specified, someone needs to check that the designs and specifications are all consistent. This is especially important for “long distance” dependencies: where the correct function of one component depends on the correct function of another component in a different part of the system. (More formally, when two components A and B depend on each other for correct function, and the lowest common parent of A and B in the component hierarchy is near the top of the hierarchy.)

36.5.5 Safety and security design

As we will discuss in future chapters ! Unknown link ref, the safety and security properties of a system must be designed top down, and they need to be defined early in system development, before too many low-level components are designed.

We advocate using the systems safety methodology ! Unknown link ref, which emphasizes starting with the accidents or losses that are to be avoided, and then the conditions that must be maintained in a system to achieve safe operation. (This is different from many safety methodologies, such as functional safety, which focus on safety in the face of failure conditions and do not address safety problems arising from design or component interactions.) The categories of losses come from the safety and security objectives defined in the concept development phase.

Once these conditions are identified, systems engineers must determine how to address them in the design of the top-level system. They must then create derived specifications for each of the top-level components in the system, and show that if each of the components meets its specifications the overall system will exhibit safe or secure behavior by complying with the safety and security conditions. This process is repeated through at increasingly lower levels of the system.

36.5.6 Review and approval

A specification guides the design and implementation of parts of the system. Given the importance of this role, a specification—or an update to a specification—should be reviewed before being committed to. Each specification should be checked by the people whose work it affects: system designers, the designer of the component or system that contains the thing being specified, potential implementers, and those people who are working on components that will interface with or use the component being specified.

As with other system artifacts, a specification or specification update should be under configuration management so that each user can determine whether they are using the correct version or not, and whether the version they are using is a proposed or work in progress version, is the current approved (baselined) version, or a version that has become obsolete.

36.6 Using specifications for a component

We now turn our attention to those people and activities who use a component to design and implement a component; that is, who are concerned with how the internals of a component reflect its specification.

One track follows the design and implementation of the component itself, which should result in a component that complies with the specification. The other track follows the design and implementation of verification methods, such as tests or static analyses. The tracks come together when the implementation gets checked by the various verification methods, resulting in a determination of whether the implementation is in fact compliant, or whether the design and implementation need to be fixed to bring it to compliance.

36.6.1 Learning about a component

A specification is an abstracted view of what a component should be. That makes it useful as a guide for someone who needs to learn about a component, before diving into the design or implementation of that component.

Someone who is learning about a component—or about the structure of the system across many components—needs to be able to find the relevant specifications. The specifications should be organized to support them:

36.6.2 Designing and implementing to specification

The general task of a designer or implementer is to create a component that complies with its specification. In practice, of course, this is a complex activity.

The designer needs to be able to clearly identify all of the behaviors or capabilities that the component must implement. This implies that the specification must be organized in a way that helps the designer find all of these, and in a way that can serve as a checklist for tracking which features have been satisfied and which have not yet.

As we will discuss further in upcoming chapters, the designer or implementer should be able to identify which aspects of the component have the highest design risk or are the most technically complex. The designer and implementer will often choose to focus on these hard aspects first, before dealing with aspects that are easy to solve. The hard aspects are often candidates for prototyping, in order to determine if a design approach is feasible and can meet the specification. (See XXX for more on prototyping and risk reduction.)

Complex systems and components can benefit from the combination of incremental development and continuous integration. Incremental development involves selecting a few parts of the component’s specification and implementing those, followed by testing. Once those aspects of the component appear sound, the developers perform a second iteration by selecting a few more aspects of the specification and adding them to the design and implementation. Continuous integration, in this context, involves performing integration testing of these partial designs and implementations in a skeleton of the rest of the system. The partial implementation of this component may use mockups of subcomponents, or interact with mockups of peer components in the system. We discuss incremental development and continuous integration more in XXX.

As people work through design and implementation, they are likely to find problems or gaps with the specification. The specification may be ambiguous in some part, or the specification may not define the behavior for some condition. The developers must be able to work with those who defined the specification to sort out these issues. The developers should check the specifications in depth, asking the specifiers questions to check their understanding or to confirm that there are issues. The developers then should work with the specifier to resolve the issues.

The developers should not make an assumption about a gap or ambiguity and move forward without confirming their assumption. The people who wrote the specification are responsible for ensuring that the specifications for different components are consistent and address large-scale safety or security concerns. The behaviors needed to support correct interaction are encoded in the specification. The developers are responsible for implementing components that correctly support these behaviors so that the resulting system works correctly. The developers do not necessarily have the big-picture perspective to make changes to these critical behaviors, and do not necessarily know who else needs to know about an assumption in how a component is defined. The developers need to work collaboratively with those responsible for the specifications so that all the pieces of the system remain consistent and correct, and so that everyone involved shares a common understanding of how the components and system are to function.

A component’s implementation will need to be verified against the component’s specification. People using continuous testing or test-driven development methods have had good results producing correct component implementations efficiently by testing an implementation in small increments as functionality gets added to it. This reduces the risk that the design or implementation has made some fundamental, early mistake that becomes increasingly expensive to correct as more functionality is implemented on top of the erroneous implementation. Performing continuous testing (or verification) requires having verification cases defined and implemented concurrent with the implementation of the corresponding functionality.

Finally, each component design and implementation will need to be reviewed and approved before being accepted as finished. Verifying that the design and implementation comply with the specification is a major part of the review process. The review activities will be much easier if the specification is well organized.

36.6.3 Evolving specifications

As mentioned earlier, a component’s specification will likely change when a system remains in use for a long time. Systems engineers will need to investigate the impact of making a change to a specification before committing to the change.

The component designers and implementers are part of the investigation process. While a systems engineer can look at what will change in how a component interacts with other parts of the system, the component designers and implementers are better positioned to evaluate the effect that a change in specification will have on implementation or verification.

To change a design and implementation in response to a change in specification, the developers need to correctly determine what has changed in the specification. Having a clear mechanism for showing what requirements have been removed, added, or changed, and for showing specifically how other parts of the specification have changed, makes this task possible. In particular, being able to accurately enumerate every change is important; the developer should not have to hunt for subtle changes that may be hidden.

The decisions that are encoded in a component’s design include how different parts of the component interact with and depend on each other. When a component’s design is to be changed in response to a change in specification, some parts of the design will be directly affected. For example, a decision to add a new input message to a component directly implies that new message reception and handling functions must be implemented. However, one change can affect other parts of the existing design, and the designer and implementer must find and address all of these effects. The example new input message, for example, might require changes to a database schema for storing additional information, or might affect response time behaviors that require changes to foundational concurrency control capabilities in the design. Having a clear record of how parts within one component are designed to depend on or affect each other reduces the effort involved in making this kind of change, and reduces the chances of an error stemming from some dependency being overlooked.

36.6.4 Verifying a component

The specification defines what a component should be or do; the design and implementation define how it is or does these things. Verification is the process of ensuring that the implementation produces behaviors that match the specification.

Every element of the specification should have a corresponding method for verifying compliance of the implementation. Different aspects of the specification will require different methods: some aspects can be verified by testing, such as showing that given some input A, the component responds with behavior B. Other aspects will require demonstration, such as showing that a physically representative user can see and reach control devices. Some aspects—especially safety and security—can only be verified by analysis or formal methods, such as showing that a component never enters performs some action identified as unsafe.

Verification methods involve design and implementation, similar to the design and implementation of the component itself.

Designing a verification method involves, first, determining how a specification property can be verified. (Sometimes a property is best verified using more than one approach in parallel.) once the approach—testing, review, demonstration, or analysis—has been determined, the next step is to design how that specific specification property will be checked. That can involve designing a set of test cases that cover the expected behaviors, or defining a test procedure to evaluate a mechanical component, or defining who will perform a review and what they will look for.

Implementing a verification method turns the design into a specific set of tools and actions that, when used, give a yes-or-no answer to whether the component is compliant.

The verification methods can have errors. Indeed, in some cases the verification of a property can be more complex than the component implementation it is checking. This means that the verification designs and implementation need careful scrutiny to ensure that they are, in fact, checking the specified properties and not something else.

The verification methods also must be complete: if some property is worth specifying, it is worth verifying. The verification designs and implementations need to be checked to ensure that they cover all of the specification. Explicitly recording which parts of the specification any particular verification method checks helps the task of checking completeness.

Finally, it is common for project management to track what portion of a component’s specification has been completed and verified. This can be organized by identifying each property in the specification, and tracking which verification methods check each one. As verifications are done, the project managers can determine which parts of the specification correspond to verification activities that passed.

36.7 Specification artifacts

Specification activities take as input the objectives ! Unknown link ref and CONOPS ! Unknown link ref artifacts that were generated during concept development.

The elements in the specification should include traces that show how each individual part of the specification derives from some part of the objectives or CONOPS, and conversely how each part of the objectives is reflected in the specification.

The specification artifacts should be maintained under configuration management. That means that there should be a common repository that everyone working on the system can use to retrieve (and potentially update) the artifacts. The repository should maintain separate versions of each artifact, and clearly identify which version is the current, baselined version that people should use, which versions are outdated, and which are works in progress.

The configuration management system should support people reviewing a specification, and must support recording when a particular version has been approved to be baselined.

Chapter 37: Requirements

37.1 What are requirements?

Requirements are one kind of specification: they say something about a property that a component or system should have, or a behavior they should exhibit.

A requirement is a specification in the form of a single, declarative textual statement. In the simplest case, a requirement is a statements of the form:

There are many nuances and variations on this basic form, but they are all extensions of this basic idea.

Requirements are written this way in order to maximize the simplicity and clarity of the specification.

Requirements are only one part of the specification for a component or system. They document specific facts about a system’s design, but they do not document the explanation of how that particular design came to be. They do not document the general purpose and scope of a particular component. They do not document complex interaction patterns. These other parts of a specification are documented in other design artifacts that complement requirements.

37.1.1 Why write requirements?

One of the jobs of systems engineering is to ensure that a user or consumer of some artifact (system or component) will be satisfied with the artifact once it is built and deployed.

The specifications for a system or component serve as a way to organize the information about what the user wants, and to organize the process of checking that the final result meets the user’s desires. The specification thus acts as a kind of implicit contract between the end user and the implementers: if the user agrees that the specification properly records their objectives, and the resulting system can be verified to meet the specification, then then the implementers have built something that satisfies what the user agreed to. (Whether the user is actually satisfied is a separate matter.)

XXX would a couple diagrams help here? A first one might show user → conceptual artifact, conceptual artifact → developer → concrete artifact; a second one might show systems and verification in the picture?

This means that there are three main uses for requirements (and the rest of specifications):

A systems engineer is typically the keeper of the specifications, responsible for overseeing the writing, changing, and verification of requirements and other specifications.

Requirements—and all specifications—are therefore acts of communication between multiple groups of people with different roles in building the system.

Systems engineers are facilitators and interpreters in this communication between users and implementers. They are responsible for translating information received from users into specifications (including requirements), for explaining the specifications back to the users for validation. The information from the user is often unstructured and incomplete. It is up to the systems engineer to work with the user to clarify their objectives and ensure that the result accurately reflects the user’s intent. The systems engineer also works to ensure that the specifications are complete. This often involves identifying use cases that the user has not thought of themselves and working with the user to define what behavior the system should have in those other cases.

The systems engineer also facilitates the implementer’s work. The systems engineer develops specifications so that the implementer has a clear guide to what they need to design and build; this requires that the systems engineer provide translation or explanation when the specification does not use the same terms or concepts that the implementers do. The systems engineer is also responsible for ensuring that the final artifact meets the customer’s objectives by overseeing the verification of the implementation against requirements (and other specifications). This involves working with verifiers to ensure that verification methods match the requirements, and checking that all requirements have been verified before the system is declared done.

A systems engineer performs other tasks using requirements, such as checking consistency or completeness. We will discuss these tasks in a later section.

A good requirement must meet several objectives in order to provide accurate communication between all these parties:

These needs lead to conventions about how requirements are written and organized, as we will discuss later.

37.1.2 What are requirements about?

Requirements are a general-purpose way of writing down facts about what something is supposed to be (or not be).

Requirements can apply to just about anything. In a typical system project, they will be used to:

37.1.3 The context for requirements

Most requirements in a system will apply to particular components in the system. The component breakdown structure provides the list of components that requirements can be about.

Requirements are part of more general specifications for the system and its components. The specifications include

The requirements must be consistent with these other parts of a component’s specification.

In the end, requirements are satisfied by the implementation of the components in the system. Being able to trace the connection from a component’s requirements to the pieces of the implementation matters in order to be able to show that the requirements are satisfied.

37.2 A single requirement

A requirement itself is a single statement about something that should be true about something.

37.2.1 Example

Consider an example of a statement of what the mission manager for a small spacecraft mission wants:

This sentence has a number of problems. It mixes statements together: the mission and the spacecraft, the operating environment and the lifetime. The sentence is not very precise: what is “low Earth orbit”? What does the spacecraft have to do to “operate”? It is unachievable: nobody can guarantee that a spacecraft will function for a particular duration as an absolute guarantee; what if there is an unusual solar flare that fries its electronics?

We can improve the example sentence a bit by splitting it into three requirements statements:

These requirements improve the original statement. First, it splits the original so that each requirement is about a single topic (and is written in the subject-mode-property form). Second, it improves the description of two of the requirements by making them more achievable (“95% probability”) and precise (altitude range given).

These three requirements in themselves are not sufficient. Before the requirements are done being written, for example, there will need to be a definition of what “operate nominally” means. Similarly, the “at least three years” requirement is not enough by itself: three years would be difficult or impossible to meet if the intended environment were the surface of Venus; it would be almost trivially easy in the intended environment were an air conditioned clean room. Adding more information about the environment is necessary to interpret the three-year condition—for example, what is the expected radiation environment at those altitudes?

The three example requirements are not sufficient in another way: they are high-level and provide the designer of, say, a battery subsystem no guidance about how the battery must be designed so that the spacecraft meets these requirements. The derivation or flow down is the topic of an upcoming section.

37.2.2 Rationale

A well-written requirement is concise. As such, it makes a statement about what a component should do—but the text of the requirement does not capture why the component should do that.

Good requirements should include a rationale statement that documents the thinking that went into choosing to make the requirement. The rationale does not change the requirement; it only adds explanation. The rationale helps those who must come along later, after the requirements are written, to understand or evaluate the requirements. It helps educate other engineers about considerations that may not be obvious. It helps those who later need to revise requirements understand what constraints there may be on the requirement they are changing.

37.3 Multiple requirements

The meaning of a group of requirements is the logical and of all of them. If there are ten requirements, an implementation complies with the requirements if it complies with all ten of them individually.

There are two issues to watch out for when there are multiple requirements: contradictions and exclusivity.

Exclusivity: If a collection includes a requirement

A must do X,

it is perfectly reasonable to also have another requirement

A must do Y.

Having both of them means that there are two things that A must do.

The question then arises: if component A also does Z, is that compliant or not? In some cases it is okay if A does Z (it has a feature that isn’t used) and sometimes it is not (if it is important that A only does X and Y and nothing else ever).

The answer is that having requirements about doing X and Y means that the requirements are silent on Z. If the requirements are silent on a topic, that topic is not considered important and it doesn’t matter for compliance. (If the topic is important, it needs to be included in the requirements.)

If it is important that A only does X and Y and nothing else, that needs to be stated explicitly. This can sometimes be written directly into one requirement:

The component must be colored one of red, green, or blue

This can also be written in a general negative form:

The component must not do any activity not listed in these requirements

Explicitly listing the allowed activities is preferable to a “must not” requirement—the negative form is convoluted and easy to misread.

37.4 Organizing requirements

Even a moderately-sized system will typically have thousands of requirements. Users need some kind of organization of all those requirements in order to find the requirements they will be working with.

There are three concepts to discuss: organizing by subject, organizing by sections, and hierarchical writing.

37.4.1 Levels of requirements

People use requirements for different purposes. This leads to fundamentally different kinds or requirements.

At the most abstract level, the general product or mission objectives capture what stakeholders want the system to do—its purpose. These almost always start as general, vague statements. The stakeholders, system engineers, and product managers refine these over time into a clearer definition of the system’s purpose. The exercise may or may not result in proper requirements statements, but it is worth treating the results as if they are requirements and showing how the top-level system requirements derive from these objectives.

Projects also have guiding objectives that do not specify the system directly, but instead define policy or standards that the system must adhere to. There are many kinds of policies, including:

It is helpful to organize the product/mission objectives and all the various policies and standards into separate collections, identified by the kind of policy or source of objectives. For example, one can maintain one collection for business policy and a separate one for the quality assurance standard being used to build a system.

The top-level requirements on the system as a whole are part of the formal or semi-formal definition of what the system is to do. These requirements say what the system is and does when looked at from the outside, as a black box. These requirements are best kept separate from the more vague product/mission objectives—the objectives represent desires, while the top-level requirements represent the commitments made for what the system will do. The derivation mapping from objectives to top-level requirements provides a place to record the rationale for why different decisions were made about the commitments in the system, and why the decision was made not to commit to supporting some desires, represented in objectives.

Requirements on lower-level components provide definitions of what the pieces that make up the system must do. These obviously have a different scope than the top-level requirements for the whole system.

37.4.2 Organizing by subject

The first concept is that requirements should be organized by their subject, following the component breakdown structure.

The system objectives are those requirements that apply to the system as a whole. These typically encode the CONOPS for the system, along with requirements derived from the process or design standards.

The rest of the requirements apply to specific components within the system. The component breakdown structure defines what the components are, and gives them names.

Organizing by component is important for proper verification, so that each requirement can be connected to the implementation artifacts that are expected to comply with the requirement, and so that the implementer of some component can properly determine all the requirements they need to adhere to.

37.4.3 Organizing by section

One single component or process/design standard can often have several hundred requirements. Users can find and work with all these requirements more easily if they are organized by topic as well as by subject.

This can be done by creating a set of topic sections within each component. Often these sections are the same for all components—sometimes empty when they are not relevant, but having the same organization across all components help people find what they are looking for.

There is no one recommended set of sections that will apply to every system. The choice of sections is affected by the kind of system or components being developed, as well as by process and design standards. For example, if an automotive project is following the ISO 26262 Functional Safety standards [ISO26262], the Safety Goals and/or Safety Requirements should be collected into one section.

As a starting point, we have used variations on the following set of sections in several projects:

It’s a good idea to work out one or a few section structures that work for your project, then use those sections consistently across all components.

Keep in mind that some requirements will always fit into multiple sections. For example, a requirement may both be about regulatory compliance and define a function the component is supposed to provide. Try to make consistent choices about which section a requirement goes in, but don’t try to make some perfect hierarchical section scheme that would let people avoid making such choices.

37.4.4 Hierarchical versus flat requirements

There are two general structures for organizing requirements on a particular topic:

The flat organization has all requirements within a section be at the same level. Each requirement is independent of the others and can be understood only by reading the text of the requirement.

The hierarchical organization places requirements into an outline, with general requirements and more specific sub-requirements. The sub-requirements must be read and understood in the context of their parent. The sub-requirements provide details, clarification, or limitations on the general parent.

Consider a set of requirements for security on a TCP/IP communication channel. The general requirement is that the communication channel should be authenticated and encrypted. In outline form, this looks like:

Consider requirement 1.1.1, requiring mutual authentication for the communication channel in question. The requirement for mutual authentication must be understood only to apply to communication channel X. There could well be another communication channel, called Y, that does not have the same authentication requirements.

Each of these statements can be read on their own; each statement includes all the necessary qualifications (“the protocol for communication channel X must…”) to identify the scope of its subject without having to refer to other statements.

37.4.5 Requirement identifiers

People use this identifier to refer to the requirement, including using it as a bookmark or link to reference the requirement in other documents. Software check ins to a repository often use the requirement identifier to indicate what functionality is being added to the repository. Task management systems use requirement identifiers to track the progress on implementing and verifying particular requirements. In general, the requirement identifier enables the integration of requirements management with other tools and tasks

The identifier must be stable. That is, once a requirement has been given an identifier, that identifier should not change. The text of the requirement can (and will) change, but the identifier remains a stable way to refer to the requirement in documents, email, and other messages without having to track down all the uses of the identifier and change them.

It is good practice for the identifier to convey some information about the requirement. At minimum, the identifier should make it clear what component or body of external requirements the identifier applies to. If one writes requirements hierarchically, then using the number of the requirement in the outline is a good identifier.

Having the identifier carry some information helps the user check that they are referencing the requirement they intended to reference. It also helps the reader to know generally what the writer is talking about, without going into a requirements management system to check.

For many projects, I have used the format <component id>:<hierarchical requirement number> as the identifier. For example, space.eps.panels:3.4.2 for a requirement applying to a spacecraft’s solar panels.

There are requirements management systems that use a universal, flat namespace for identifiers, such as REQ-82763. This is not a good identifier, because it makes it hard to check when one has accidentally mistyped or miscopied the identifier into another document. If one accidentally types REQ-82764 into another document, that other requirement could apply to a completely different component—and the mistake is obscured.

37.5 Writing good requirements

Requirements are a way of communicating between people on a project: between the customer and systems engineers, between those who look at how multiple systems work together and those who implement the pieces, between those who design and those who test. A good requirement is one understood equally well by all the people who use that requirement.

Writing good requirements takes practice, but the following guidelines will help in writing and reading requirements.

37.5.1 General form

The subject is often a component named in the component breakdown structure. It should be named explicitly:

The majority of requirements use either the word “shall” or “must”, depending on the organization and industry. “Shall” indicates an assertion that the statement about the subject is to be true in the implemented system. “Must” expresses the obligation that the statement will be true in the system. In practice the two words mean the same thing when writing requirements.

Writing the predicate is usually the complex part of writing a requirement. In some cases the predicate is simple:

In other cases, the predicate must have conditions added, saying when or under what conditions the predicate applies:

Sometimes the requirement statement is easier to read if the condition clauses are presented in a different, natural order. However, the semantics remain the same: the clause is part of the property statement:

37.5.2 Single topic

A requirement should specify a single property of the subject. The examples above all deal with a single property.

There are requirements that may have multiple things in their property statement that still deal only with a single property. For example:

Formally, this requirement deals with a single property: what color the widget may be painted. The color is restricted to a set of three colors—but the property in question is the color.

Note that this requirement is slightly ambiguous: it is not clear whether the widget can be painted only one of those colors, or some mixture of them. This requirement could be improved by either rewriting it as:

37.5.3 Clarity about subject

A good requirement must be clear about what thing it applies to. In general it is best to write down a proper name of the subject—the name of the relevant component in the breakdown structure, for example.

This rule makes for a lot of repetition in requirements. “The control system must X”, “The control system must Y”, “The control system must Z”, and so on. While it means a little more typing, using the component’s name in each requirement means that each requirement can be understood on its own.

37.5.4 Consistent language

Use consistent terms throughout requirements. Always call component X by one name; don’t change it from requirement to requirement. Always call some one function by the same name, so that it’s clear that all the relevant requirements really are talking about the same thing.

Having lists of names or terms helps those who write requirements to use consistent terms, and provides those who read requirements with definitions when they need to confirm what a term refers to. This means:

37.5.5 Plain language

Requirements (and the rest of specifications) may be written by one or a few people, but they will be read by many people. The readers need to understand correctly what the requirements mean. Many of those readers will be learning about the system by reading requirements or other documents, so they won’t enter into reading the requirements with the same context that system engineers writing the requirements will have.

This means: don’t get fancy with requirements language. There are some ways that requirements will sound stilted, like the subject-mode-property form. There is some technical jargon that is needed to make the requirement precise. But don’t make the language more complex than it needs to be.

For any words or phrases that do not have a meaning that will be obvious to all your readers, help them out by defining how those words are being used in the specifications. Start with “must” versus “shall” and any other mode words (see Advanced Requirements below). Provide a glossary of the definitions of the rest of the words.

37.5.6 Negative requirements and “only”

Many organizations prohibit requirements that say “shall not”. Negative requirements have their place, but they are tricky to get right. The problems arise with exactly how broad or narrow the requirement actually is.

Consider a component implementation that could do one of three behaviors, A, B, or C.

If the component has a requirement “the component shall do A”, the implementation satisfies the requirement (it does A). That is because the requirement, as written, allows for the implementation to do other behaviors as well.

If the component has a requirement “the component shall only do A”, then the implementation does not satisfy the requirement because the implementation might do other things.

Now consider a requirement such as “the component shall not do D”. The implementation does satisfy the requirement, but not necessarily in a helpful way. Just because the component doesn’t do D, what should it do? Are behaviors A, B, and C all acceptable? What about behavior E?

In most cases it is clearer to name exactly the behaviors that are required, because that is unambiguous. One can write verification conditions to test exactly what is allowed.

Sometimes, however, one should write a negative requirement. If there is some behavior that really, truly must never happen, then writing a “shall not” requirement calls out that important condition, and a verification test can be designed to show that the system will not do the thing it isn’t supposed to. The negative requirement should usually be paired with a positive requirement that says what the system should do instead.

Safety and security properties often require stating a negative requirement, because these properties are fundamentally definitions of things that the system is to be designed not to do. I have not been able to imagine a way to write “a robot may not injure a human being” [Asimov50] as a positive requirement.

Verifying negative requirements is more complex than verifying positive requirements. See Section 14.4.

37.5.7 Avoid “it”

Avoid the word “it” and other non-specific pronouns or modifiers (“they”, “those”, “them”, “its”). Repeat the name of a thing involved in the property, even if that seems repetitive and wordy. An example:

Because the “it” in the first example is ambiguous: the word could refer to the mode or to the control system.

37.5.8 Avoid impossibly high bars

There are things that we want a system to do. When writing a requirement, it is tempting to write something like

Unfortunately, this three-year required property of the spacecraft is virtually impossible to meet (unless, maybe, the “spacecraft” is a large, inert chunk of rock). A spacecraft has many parts, operates in a difficult environment, and is built by fallible humans.

The problem with this requirement is that it sets a bar that is so high that no real spacecraft can meet it. The requirement does not allow for any off-nominal operation. It doesn’t allow for a spacecraft to have a temporary fault and then recover. It doesn’t allow for debris to impact the spacecraft. In fact, this requirement is met only when the spacecraft is perfect for those three years. Any real spacecraft will fail verification if it has a requirement like this.

This kind of requirement needs to be modified to something more realistic. There are many ways to do that. The NASA Systems Engineering Handbook has the rule that a requirement should specify “tolerances for qualitative/performance values (e.g., less than, greater than or equal to, plus or minus, 3 sigma root sum squares)” [NASA16, Appendix C].

37.5.9 Measurable conditions

The point of a requirement is that someone can determine whether an implementation complies with the statement in the requirement. Operationally, this means that a requirement can be verified (see the section on verification below).

One way to make a requirement measurable is to specify the condition quantitatively. For example, a spacecraft’s battery must be able to store at minimum X milliamp-hours. It’s not hard for a test engineer to see how to create a test to verify that the battery complies.

Other requirements, especially those that specify an action that should be taken under some condition, aren’t quantitative, but instead are measured by observing whether the required action is taken. The verification tests will involve either creating the condition under which the action is to occur or observing that the condition has occurred, and then observing that the required action has been taken. For this kind of requirement to be useful, a test engineer must be able to understand accurately the enabling condition and be able to create or detect that condition. The test engineer must also be able to understand the action that is supposed to occur, and detect that it has occurred. If the enabling condition or action can’t be detected, then the requirement is not readily measurable.

Requirements on low-level components are often easier to make measurable than requirements on high-level components. This is why high-level requirements are often verified by looking at requirements derived from the high-level requirement rather than by trying to construct a verification test directly on the high-level requirement.

37.5.10 Unverifiable conditions

When writing requirements for human-machine interaction or user interfaces, the underlying need is that a user can understand what the system is doing, and give it the right commands so that the system does what the user wants.

How would someone verify that the system as designed or implemented actually meets this objective? The statement is too vague actually to test.

First, one needs to break the objective up into a number of more-specific objectives. This often involves putting together a list of what it means to “understand what the system is doing”. This might involve:

This breakdown is an improvement over the original desired objective, but the conditions are still not verifiable. As we will see in the later section on requirement derivation, these can be turned into high-level requirements that are broken down further, and the verification condition on these high-level requirements consists of, first, verifying all of the derived requirements, and then showing an argument that satisfying all the derived requirements shows that the high-level requirement is satisfied.

The derived requirements about “perceiving” or “observing” are themselves not verifiable: how does one verify that a person has observed, or can observe, some state of the system? This needs to be broken down into yet further, more specific requirements. For example,

If all these steps are satisfied and work correctly, then the person should be able to see the amount of fuel remaining.

Focus on the last two functions in the chain: that a person can see the indicator and that they can observe the indication. Seeing the indicator can be in turn broken down into further requirements, primarily on the physical structure around the person. For example, some of these might be:

There is some prerequisite information needed to verify these examples. For example, what range of sizes will the users be? In order to check for unobstructed line of sight, one must know where the user’s head will be. What visual acuity or color perception abilities are required of the users? A color blind user will not be able to perceive some color differences that might be used to convey necessary information. What expectations will a user bring to the task? If a user is socially conditioned that green means good and red means bad or stop, using different colors to indicate good or stop will be hard for a user to interpret.

How would one go about verifying these requirements? There are multiple techniques that will help—and usually the techniques must be used together to really check whether a requirement is satisfied. These techniques are a combination of analysis using models and real-world measurement.

The experimental approaches are often the most expensive in time and money, but they are the gold standard for verifying a human interface requirement. Conforming to standards can help address expectations that users will bring to tasks.

In summary, there are several tools for addressing requirements that are too vague or complex to verify:

XXX revisit this section to bring it into line with the Leveson viewpoint on user interaction as control

37.5.11 Detail appropriate to the level

Requirements should be written as a description of what one sees in a component when looking at it from the outside—a black box view. A good requirement does not go into how the feature or behavior is implemented inside the black box.

Put another way, the requirements for a component are documentation of how the component fits into the system around it. If component A is part of a larger component B, the requirements on A document what the implementation of B needs for A to do its part correctly. If components C and D are peers, the requirements document what they will need from each other for both to do their job.

This matter connects directly to requirements derivation from component to subcomponent, which is discussed in the next section.

It is tempting to skip right to the details of how a component is built. Don’t do it; provide other people the benefit of your understanding of the problem, not just the final design answer.

37.6 Requirement derivation

No requirement stands entirely on its own. Almost all requirements have some reason that they have been included in a system, starting with: this requirement is necessary so that the system meets some objective. In lower-level components, the reason often is: this requirement is necessary so that this component provides some feature that other components depend on.

These are examples of requirement derivation. Derivation encodes the relationship between requirements.

Almost all requirements are derived from other requirements, and the requirements in a system must keep track of how one requirement leads to another, or how one is dependent upon another.

37.6.1 Subcomponents providing features for parent

A parent component has a requirement that the component provide some feature. The requirement in the parent specifies what the parent must do, but does not specify how to implement that feature. The design of the parent component, and later, the implementation, document how the parent component will satisfy that requirement.

When the designer decides on the implementation, they will decide (among other things) how the parent component will use subcomponents to implement the feature. These decisions create requirements on the subcomponents so that they provide the features that the parent component will use.

The reason for these requirements on subcomponents is that they are necessary to satisfy the requirement on the parent component. A derivation relationship between the parent requirement and the subcomponent requirements documents why the subcomponents have the requirements they do.

Consider a spacecraft example. The spacecraft as a whole has a requirement that it be able to point at a ground location, with some number of degrees of accuracy. To implement that feature, the spacecraft designer chooses to use the spacecraft’s attitude control system to point the spacecraft toward a ground location, and then slowly rotate the spacecraft as it passes over the ground location. The parent component—the spacecraft—has the high-level requirements for what it needs to do. The subcomponent—the attitude control system—must be able to slew accurately to an initial pointing vector, and then be able to slew slowly and accurately until the spacecraft is done with an observation. The slewing accuracy and speed are the derived requirements on the attitude control system.

The process continues recursively. The attitude control system designer decides to use reaction wheels as the primary attitude control mechanism. The requirements for slewing accuracy and speed create requirements on the reaction wheels for how quickly or slowly they can turn the spacecraft.

37.6.2 Internal derivation

Some components will have a requirement that specifies a very high-level capability the component must provide. For example, in a section on disposing of a component that is being discarded:

There are several ways this requirement could be met: destroying the retired component in house, crashing the component into the atmosphere or ground in a way that will assure the component is destroyed, or erasing the data on the component before giving the component to an outside entity for recycling.

Whatever the implementation decision is, it creates more requirements on the component, and those requirements derive from the decision on how to satisfy the requirement on protecting confidential information. If, for example, the implementation decision is to recycle a retired part, then this might lead to requirements like:

In some organizations, the practice is only to record derivation from one component to another. Sometimes that works out; in the example, the requirement for an erasure command could be on a command handling subcomponent, and the erasure requirement could be on a memory component. However, some components do not break down into subcomponents easily—for example, when the component is being implemented by an outside vendor. In other cases, it is simply clearer to document the implementation requirements for the component directly and then passing the requirements through to subcomponents, so that a user can see the totality of the functional interface to the component in one place rather than having to search through subcomponents for something they don’t know exists.

37.6.3 Pass through

External objectives and standards often impose general requirements on “all components of type X”, or the like. For example, an automobile might have a requirement that all electronic components function nominally across a temperature range of -40º C to +125º C. (See the section on Sets as subjects below for more on this.)

This requirement can be placed on the automobile as a whole; the requirement might read

If the automobile includes engine, braking system, and entertainment systems as parts, the temperature range requirement can be passed down to those subcomponents:

But the entertainment system, which is not safety critical and operates in the more benign environment of the passenger cabin, might have the requirement:

In these examples, the general requirement is copied down into lower-level subcomponents until it reaches some component (such as the braking controller in the example) that does not have further subcomponents. Sometimes the requirement is copied verbatim, just changing the scope of the subject; other times, some component will have a variant on the general requirement.

This kind of derivation is sometimes referred to as allocating requirements to subcomponents.

37.6.4 Mutual dependency

Sometimes two components are peers of each other, and need to interact. A fuel tank provides fuel to an engine; a spacecraft communicates with a ground station to send telemetry and receive commands; client and server applications send messages to each other.

These interactions involve requirements on each of the components involved, showing how the components support each other. The fuel tank must send fuel; the engine must consume fuel. The spacecraft must be able to communicate with the ground station; the ground station must be able to communicate with the spacecraft.

This leads to pairs of requirements that record this mutual dependency. At a high level,

These two requirements should show a two-way relationship with each other. (Formally, this introduces a cycle in the derivation graph.)

37.6.5 Using derivation

A derivation relationship between requirements on two different components helps to document the implementation approach for meeting a higher-level requirement. When a designer looks at the high-level requirement, they can see what features are used to implement the high-level requirement. The lower level requirements and their rationale allow the designer to see the argument that the implementation will be sufficient to meet the high-level requirement. This makes the design rationale available to people who didn’t create the design in the first place, but need to understand it to evaluate it or to make changes.

The section on analyzing requirements, below, goes into more detail on how one can look at the requirement derivation relationships to evaluate completeness or sufficiency, to argue whether low-level features are actually necessary, and to trace out the effects of making a change in requirements.

37.6.6 Viewing derivation

There are two ways that a user should be able to see derivation relationships. First, when looking at any one requirement, the user should be able to see what requirements this one is derived from directly, and what requirements derive directly from this one.

Good requirement management tools will also provide a view of the graph that shows derivation graphically. Derivation relationships can be viewed as a graph, as a way to see multiple levels of derivation. The graph is typically mostly a tree or DAG, but there are legitimate reasons that the graph will sometimes have cycles (between peer components, for example).

Here is an example showing how a top-level requirement is the source for a number of other requirements.

37.7 Advanced requirements

All the requirements discussed so far are simple requirements. Simple requirements have a single, clearly specified subject component. Each simple requirement expresses one property about that subject that must be true.

Simple requirements are not sufficient to express every need that real systems encounter. There are two that we have seen many times: requirements on sets of components, and requirements for standards.

37.7.1 Sets as subjects

Consider a system where all code is expected to adhere to a published coding standard. The implied requirement does not apply to any single component; it applies to all of them that include software.

This expectation can be written as a top-level requirement on the system as a whole:

The subject of this requirement is the set of all software components in the system. The property is that their implementation adheres to the named coding standard.

This kind of requirement is placed on the top-level system, and then each first-level subcomponent includes a derived requirement that propagates the requirement downward:

If component Y has subcomponents, Y should also have a second requirement that continues to pass the requirement down to Y’s subcomponents.

37.7.2 Writing for standards

Many texts on requirements approach the subject from an assumption that there is one system being built: these are the requirements for System X. System X will be built in its entirety as specified; any and all requirements must be satisfied.

Writing standards is a different problem. A standard is specifying requirements on multiple hypothetical systems that may exist at some point. Those systems will not be identical, but the systems that adhere to the standards must adhere to the requirements in the standard.

Standards often provide options. The standard has a set of optional features. If the system chooses to implement those features, the features must conform to the standard. However, the system does not have to implement those features. This means that the system does not have to satisfy every requirement in the standard.

Some standards also present best practices. For some feature, it is recommended that the feature conforms to a part of the standard, but it is not absolutely required to do so.

The vocabulary of “shall” or “must” does not accommodate these situations well. The Internet Engineering Task Force (IETF) has defined a richer set of requirement modes. For example:

The words used to indicate these more complex conditions must be defined just as carefully as “must” or “shall”, and must be used consistently.

37.8 Analyzing requirements

Many people think of requirements only as a contract for guiding implementation and a checklist for performing verification tests later. However, requirements—along with other specifications—are useful in themselves for helping build a design and making sure the design is good.

There are three kinds of analysis that systems engineers do on the requirements themselves:

These are all analyses that should be done on the specifications of a system, including the requirements, and not delayed until implementation. Some of these tasks are easier to perform on the abstracted and simplified view of the system that specifications give. Performing these tasks before implementation will reduce the amount of re-implementation needed when one finds that the requirements aren’t sufficient or minimal.

37.8.1 Complete design

The expectation is that if a system is built to conform to its specification, including requirements, that the system will do the job that its users need and do it correctly. (Of course, this assumes that the top-level specifications are themselves a correct and complete record of the users’ objectives; we discuss this more in the section on validating requirements below.)

To meet this expectation, the system’s requirements need to be complete and correct. This means that when one looks at any given top-level requirement, one can trace out the features on other components that will be used to implement the requirement and argue that those features will combine correctly to produce the desired result.

Having tools that allow one to view parts of the derivation graph in visual, graphical form is invaluable to performing this analysis.

Consider an example. A UAV (drone) is supposed to receive and process commands from an operator on the ground. This leads to requirements:

These requirements are not complete, because they leave out a critical step: when a command is sent from the ground operator to the UAV, the message first goes to the transceiver. The receiver extracts the message, and then sends the message to the command and data handling component. The example omits the part about the transceiver and command handler passing information to each other. This means that one could build an aircraft that had a radio and had a flight computer, but the two would never talk to each other. Obviously, the UAV would not be acting on commands with that design.

In the example, the communication between the transceiver and command handling components should be documented in some other specification for the UAV, perhaps an activity diagram showing how commands flow through components. The requirements then need to be checked against these other parts of the specification to make sure that all of the functions in each of the steps are reflected in the functions each component is required to implement.

Sometimes determining whether a set of requirements is complete or not will require further analyses. As a simple example, the maximum mass for an aircraft might be X kg. Making sure that the aircraft’s overall mass comes in under that limit means enumerating all the components in the aircraft that have mass, adding up their mass, and determining that the result is below X kg. For that analysis to be complete, it cannot leave out, say, the mass of the motors; all components must be considered.

As a more complex example, a system might have a maximum acceptable failure rate target. Being able to argue that the system is reliable enough involves performing a fault tree analysis, enumerating all the ways that failures in components can lead to system failures. The analysis cannot leave out components and be complete; nor can it leave out some failure modes of some of those components.

Checking whether the design is complete is not a simple task that can be performed just by inspecting the graph of requirements. The analysis is helped by being able to see the requirements, but it requires imagination and effort to actually check the result.

37.8.2 Minimal design

Every feature and every requirement on a component should have a reason for being there.

At the top level, for the system as a whole, only features that address customer needs or business objectives should be included. At lower levels, the only requirements that should be placed on components should be ones that are actually needed to make the system work properly—meaning the system meets those top-level objectives.

37.8.2.1 Tracking the purpose for every requirement

The derivation relationships between requirements encode the reasons for a requirement to exist. This leads to a condition that should hold across all requirements:

This is straightforward to check using the derivation graph: every requirement should derive from at least one parent requirement, and it should be possible to trace upward through the derivations to reach a customer or business objective.

Often while requirements are being developed, a requirement will be placed on some component without setting up the derivation. This requirement will not have a parent, and so the checking method will flag it. But what to do then?

In most cases, there was a good reason that someone wrote that component requirement. When one finds a requirement that is not documented as supporting some higher-level reason, it is worth exploring why that requirement is valuable. In some cases, the parent requirement(s) are present, and the requirement just needs to be linked to them. In other cases, the requirement can be a clue that there is some higher-level principle that the writer had in mind, and that higher-level principle should be added into the requirements higher up in the system.

For example, consider a data storage component where an engineer placed a requirement that all data be stored in an encrypted form. As written, that requirement doesn’t derive from any other requirement. But why did the engineer believe that encryption was necessary?

One answer is that encryption isn’t necessary. In that case the encryption requirement can be removed. Another answer is that the engineer wrote that requirement because they believed that the component would be storing confidential data that should be protected against disclosure. In that case, it is worth checking: does the system have requirements—or business objectives—about protecting confidential data? If not, then this exercise will have found a topic that has not been adequately addressed, and new requirements need to be added to make a correct specification. Those requirements should be added throughout the system, and the requirement we started with should show that it derives from those new features.

Many such requirements result from external standards that are supposed to be met, such as regulatory, safety, or security standards. Those standards should be included in the external objectives for the system, their requirements should flow down through the system to the components where the standards apply. This produces a record of how the system’s design complies with those standards.

37.8.2.2 Finding unnecessary requirements

Some requirements that show how they are derived from some parent requirement are still not actually necessary.

There is no simple, mechanical way to find these unnecessary requirements. However, the analysis used to determine whether a collection of requirements is complete is also useful for finding these unneeded requirements.

The requirement about encryption is not actually needed for the system in question. That is because the connection between the transceiver and command handling components is physically contained within the UAV, and the physical encapsulation provides enough security to protect the messages passing between the two. The encryption requirement can be removed with no loss of capability.

However, in this example, the engineer who wrote the encryption requirement had a good idea but expressed it wrongly. The engineer understood that the integrity of communication between the two components was important; a command that was properly received but garbled in being sent to the command handling component could be a problem. The presence of the encryption requirement should be replaced by a less costly requirement, that the channel must protect the messages it carries against corruption.

37.8.3 Consistency

Consistency in a body of requirements is when the requirements don’t contradict each other. If requirements do contradict each other, the system as specified isn’t implementable and the specification needs to be fixed.

As long as requirements are written as text, and not in a formal notation, consistency checking will be manual. It involves reading through each requirement, finding other requirements that address related topics, and checking that they are consistent with each other.

Some inconsistencies are fairly easy to detect. If one requirement says component X shall be blue and another says component X shall be red, it’s obvious—one must just read through all the requirements on component X and see that two requirements both deal with the color property and they say opposing things.

Other inconsistencies are harder to spot because they do not use the same language in the properties they are specifying. As an example, one requirement might say component X shall use encryption algorithm Y while another requirement says component X shall use protocol standard Z. If protocol standard Z allows encryption algorithm Y, this is fine. But if the standard does not allow that particular encryption algorithm (perhaps because the algorithm is outdated and no longer considered secure enough) then there is an inconsistency.

Another class of inconsistency comes from the states a component can take on. Elsewhere in the specification of a component, there should be a definition of the state machine that the component is supposed to follow. The requirements translate that state machine into individual actions that the component is expected to take in response to particular inputs. It is easy—especially when editing or updating the component’s specification—to have two requirements: when condition A occurs, component X must transition to state Y and when condition A occurs, component X must transition to state Z. The inconsistency can be more subtle, such as leaving out some transition, or using inconsistent definitions of the condition that causes the transition. This class of problem can be addressed by having a single, clear definition of the state machine the component is expected to follow, and then checking the requirements against the state machine.

Finally, another class of inconsistency that can be hard to detect has to do with timing. Two requirements can impose timing constraints that cannot both be satisfied. For example:

There is no way for component X to meet the timing requirements given the order that events must occur. Building a timing model of the component in question, and performing a timing feasibility analysis using that model, can help find this kind of inconsistency.

This is by no means an exhaustive list of the kinds of inconsistency one must look for.

37.8.4 Effects of changes

Systems change. This can happen because customer needs change, or because technology changes, or because someone has found a better design for part of the system. A good development process supports constant evolution and change of the design and implementation of a system.

Not every change that is proposed will be performed. When someone proposes a change, someone else will analyze the proposal to determine the effects of the change. Based on this analysis, people may decide to go ahead, postpone the change, or not make the change.

This analysis makes use of all the specifications in the system, but requirements are a major contributor. In particular, the derivation relationships help show how component features depend on each other, and thus help guide an analysis of how far some change will spread.

37.8.4.1 Effects of top-level changes

Top-level changes include adding a new feature to the system, removing a desired feature, or changing a standard or other external source of requirements.

If the change changes a top-level requirement, look at the derived requirements from that changed requirement and see if the derived requirements are still necessary and sufficient to satisfy the newly-changed requirement. If they are, then no further action is needed. If they are not, then the derived requirements must be revised, possibly adding or removing some of them. The process then needs to repeat with these changed derived requirements. If the change affects a requirement that supports a different top-level requirement, then one must check that the other top-level requirement is still satisfied by the changed derived requirements.

If the change adds a new top-level requirement, work out what derived requirements are necessary and sufficient to satisfy the new requirement. Look for lower-level requirements that already exist that can also support the new requirement. This may involve a change in design, not just requirements; this will cause more changes to propagate out.

If the change removes a top-level requirement, see if any lower-level derived requirements are no longer needed or can be relaxed. If so, work downwards to propagate the effects of those changes.

37.8.4.2 Effects of lower-level changes

Many more changes will come to lower-level components in the system. There are many reasons this can happen: because people have found that a design in process is infeasible or too costly; because a vendor’s part specification or availability has changed; or because someone has found a better design for some lower-level component.

Evaluating a lower-level change involves all the checks for a top-level change above, along with the need to see how the change will affect higher-level requirements. Will the change leave the higher-level requirement unsatisfied? Will this change make some other sibling requirement redundant (that is, the parent is satisfied without the sibling)?

Tracking down these effects is much easier if the derivation relationships among requirements are accurate.

37.8.4.3 Tools

Good tools help the process of evaluating changes. There are three features in particular to look for:

37.9 Validating requirements

XXX rewrite this to bring into line with introductory language on deriving verification

Validation is the process of determining whether a set of requirements accurately reflects the needs of the system. This can mean that the system will meet customer needs, or mission needs, or other external objectives.

It is important to keep validation separate from verification, which is discussed below. Validation is about seeing if the requirements (and the rest of the specification) is an accurate reflection of external needs. Verification is about seeing if the implementation is an accurate reflection of requirements. (Some software engineering texts focus validation on consistency, completeness, and similar properties. Systems engineering has generally kept those kinds of checks separate from validating customer or mission satisfaction.)

The validation process starts with checking the system objectives, business objectives, security and safety objectives, and regulatory objectives to see if they are an accurate reflection of the customer or mission needs. Presumably appropriate care has been taken while these objectives are being gathered and written down, but mission understandings or desires change over time and an independent check on the objectives will help avoid having problems be discovered late, when it is expensive to make changes.

At lower levels, one is checking whether the derived requirements from a parent are necessary and sufficient. The analyses for complete and minimal design, discussed above, cover those checks.

There are many different ways to validate a system’s specifications. They generally fall into two groups: analysis and simulation.

Validation by analysis involves people reviewing the requirements and using their judgment to check the specifications. This can involve performing joint reviews with stakeholders so that they can check the requirements.

Validation by simulation involves stakeholders somehow seeing a model of the system in action. There are many ways to do this. Stakeholders can be invited to define some scenarios that represent how they will use the system, and then try out those scenarios using a model of the system. Some ways we have done this include:

These validation exercises should be completed and the stakeholders should concur that the specifications are correct before one baselines the specifications, including requirements.

37.9.1 Connecting requirements and implementation artifacts

People must be able to navigate from a requirement to its associated implementation artifacts and vice versa. The people implementing a part of a system according to requirements need to be able to quickly and accurately find the requirements that they need to comply with. In the other direction, the people verifying requirements must be able to find the artifact or artifacts that implement a particular requirement.

The approach to organizing systems artifacts that I advocate here, which organizes many systems work around a hierarchical component breakdown structure, is designed to meet this need conveniently. The set of requirements that apply to some component are implicitly connected to other specifications and the implementation of that component because they are all organized by the same component names and identifiers.

One can also explicitly label artifacts with component identifiers or requirement ids. For example, verification test specifications are associated with specific requirements, so the test specification needs to be labeled with the requirement ids that it applies to.

37.10 Verification

Verification is the process of showing that the implementation of the system, or parts of it, complies with the requirements.

Verification involves gathering evidence that every requirement is satisfied by the implementation.

There are four general methods used to verify the implementation’s compliance:

Inspection is verification by having people review parts of the implementation to check that it complies with a requirement. The inspection review should be performed by people who did not implement that part of the system, so that the reviewers are not misguided by preconceptions (“I’m sure I implemented this correctly”).

Some inspections are particularly simple. Consider a high-level requirement that is the source for a few lower-level requirements. In many cases, the high-level requirement is satisfied when the lower-level derived requirements are all satisfied. In these cases inspection becomes a simple matter of checking that the derived requirements are all satisfied. The rationale associated with the derivation or with the high-level requirement should indicate when this situation applies.

Test and demonstration are similar. Testing is generally more exhaustive, and necessary lower-level components. A single electronic component, for example, might be operated across all the specified thermal, vibration, and atmospheric environments it must handle. Demonstration is less exhaustive, and used to verify top-level system objectives. A prototype spacecraft radio transceiver might demonstrate that it can communicate with ground stations from a similar orbit to where the final spacecraft system will operate.

Some requirements cannot effectively be verified by test or demonstration, and must be verified using analysis. This occurs when one is verifying a negative condition: the verification must show that the system will not perform some action or be in some condition at any time. Providing evidence of the absence of some condition is a long-standing scientific and engineering problem because proving the presence of some condition is relatively easy—demonstrate it happens in one case and that’s sufficient—but showing absence often requires exhaustive search. These verification problems often arise in safety and security requirements, where unsafe failures must be rare (e.g. no more than once in 10⁹ operating hours) or a system must resist a class of attacks (showing that no attack of that class will succeed).

Each requirement should have an associated verification specification. The specification should lay out what steps must be taken to determine whether the implementation is correct or not. A verification specification is often complex—many pages of documentation for a three-line requirement.

Verification status is a measure of how well the implementation matches the specification, including requirements. In practice this means how well a version of the implementation complies with a version of the specification, as both implementation and specification evolve over time. This means that, during design or implementation, there is no one single “verification status” that can be tracked: with each new update to the implementation, the verification status changes. Some practitioners and tools make the mistake of tracking verification status only in terms of requirements: which requirements have been satisfied by the implementation? This leads to project management errors when a change is made to the implementation that improves the implementation in one area but causes other parts of the system to go out of compliance—a common occurrence while in the middle of implementation using iterative approaches.

37.11 Limitations of requirements

Requirements have limitations. Writing a good specification for a system means understanding these limitations and addressing them in one way or another.

One limitation is that requirements are written in natural language. Human language is notoriously difficult for pinning down precise meanings, even within a single group of people. Specifications, including requirements, are used to communicate between different groups of people with different outlooks, experiences, and jargon. This makes it hard to write requirements that will be interpreted the same way by all of the people involved.

The limitation of natural language can be partly mitigated using a couple of techniques. One is to maintain a glossary that defines words or phrases that have specific meanings in the specification beyond common understanding. The second is through social cohesion: having enough people from different groups interacting and discussing the system so that they evolve a common understanding of the meanings of things.

Precision is another limitation. Some specifications can be clear and simple in mathematical notation, while they are hard to follow in prose. (Consider expressing Newton’s law of gravitation as an equation versus in prose.)

A third limitation comes from requirements being single statements. Sometimes the specification needs to encode a complex, multistep activity. Each of the steps might be encoded as an individual requirement, but it is awkward and hard to understand. Sometimes the better answer is to write part of the specification in a different form—a flowchart, a state machine, or a set of equations.

As a result, requirements are only one part of the total specification. They cannot do the entire job of recording the full specification of the artifact in question—but they are often the most flexible way to organize most of the specifications. Be prepared to supplement textual requirements with other kinds of specification to get the whole job done.

37.12 Working with requirements

This chapter has mostly covered what requirements are. This section touches on what one does with them and how they evolve over time.

Requirements will change continuously over the life of a project. The rate of change will be high at the project’s beginning, when the team is trying to sort out what the system should be. The rate of change will increase after the high-level system purpose is sorted out and as the design work proceeds in parallel on different components in the system. The rate will taper off as the design and implementation become more mature, with occasional bumps as people find problems with the specifications, or as stakeholders request changes. Ideally the rate will reach zero when the system is ready to go operational, but even while in use people will find changes they would like to make.

Detailed requirements are expensive to develop and maintain. They encapsulate the complexity of how all the parts of a system are interconnected. They require effort to develop in the first place, involving checking for consistency and feasibility across large parts of the system. Changes later involve even more effort, especially if the changes involve reorganizing specifications that have already been developed.

This leads to a tension: changes will always happen, especially with modern, flexible systems, but the cost incentivizes developing all the requirements at once and then freezing them to minimize the cost of change.

This tension is unavoidable, but there are things one can do to reduce the difficulty.

37.12.1 Supporting the life cycle

The requirements for a system—and indeed all the specifications for the system—grow and evolve over time. The times and ways when requirements change depends on the development process a project is using. However, all these processes share some tasks in common.

Review and approval. People will propose updates to the system’s design as a project moves forward. This occurs often at the beginning of a project, as the design goes from vague ideas to concrete specifications; it continues during the life of the project as stakeholders ask for changes, as engineers find problems or improvements with the current design; and it can continue after a system is released to operation, as people find problems in actual use. These changes will result in specific proposed updates to the requirements. The proposed updates need to be checked before they are accepted and applied to the baseline. Once applied to the baseline, everyone developing the system implementation will need to work to revise their part of the implementation to match, and verification steps will be required, and so on—thus it is important to control changes to the baseline to be sure that they are sound and within the project’s scope before committing to them.

Projects generally use a review and approval process to decide whether to apply an update to the baseline or not. In the review part, systems engineers check the updates to ensure they meet guidelines, including consistency, completeness, and minimality. People who will be affected by the update are asked to review the update, to evaluate whether it is technically correct from their point of view and whether the change is feasible. Project managers are asked to evaluate the update to determine whether the change is in scope and whether there are resources to accommodate the change. If all those parties agree, then the update is approved and someone creates a new requirements baseline that incorporates the changes.

37.12.2 Who works with requirements

Many people generate or use requirements during the lifetime of a project. These include:

37.13 Tools

The right tools make working with requirements much easier and more accurate. However, different requirements management tools are designed to support different styles of requirement writing and use, so you need to choose tools that match how you will write, organize, and use requirements.

Here are some questions that can help you evaluate requirements management tools.

People will use the requirements management tools to perform a number of tasks. You should evaluate how well requirements tools support these activities.

Chapter 38: Models

Chapter 39: Interface definitions

Part IX: Design

Chapter 40: Design introduction

40.1 Purpose

Previous chapters introduced how to work out what a system or a component should do, by determining what the objectives are for it and then turning those objectives into a specification.

A design for a component provides a simplified model of how the component will achieve the behaviors, qualities, and structure laid out in its specification. The design is not the full details of how it will achieve those things, or a detailed implementation. The design is a plan for how the component will be built, at a high level; it records the high-level decisions about how the component will be implemented without actually being the implementation.

“Design” is an activity that lacks sharp boundaries from other development activities. On the one hand, it responds to the objectives and specifications that have been developed for the thing being built; on the other hand, the act of designing usually reveals gaps in the specifications that lead to feedback that causes people to update the specification. Specification and design proceed recursively as a system is built, where the act of designing one component leads to writing specifications for its subcomponents.

“Design” also lacks a distinct boundary with “implementation”. Indeed, the boundary between the two varies by convention in different disciplines.

40.1.1 Defining “design”

Given the diversity of ways the word “design” is used, I will define what I mean by the term in general.

In some projects I have used the term “design model” for the design, to emphasize that the design is a simplification and explanation of the most important aspects of the component’s implementation.

40.1.2 Contents of design

All of this information should be annotated with a rationale for the decisions that led to the particular design.

40.1.3 Purposes of a design

Why should one take the deliberate and separate step of putting together a design for a system or component, rather than just implementing a component directly based on its specification?

For an exceptionally simple component, one can skip design and just implement the component—but the component must be truly simple, completely understandable from its implementation, involving no significant design choices, and with no future need to change the component, for this to pay off in the long run.

The value of an explicit design comes partly from its abstraction and simplification, and partly from being done mostly before putting together the detailed implementation.

Time to reflect. This is perhaps the most important reason to take the time to build a design before implementing a component. Modern systems are deeply interconnected. The design choices for one component have effects not limited to that component, and the design choices must usually reflect the needs that many other components place on the one being designed. It takes time to find and understand all these interdependencies.

Many components can be designed in multiple different ways. It is often useful to spend some time developing multiple design approaches before settling on one of them. In many cases it is useful to have two or three design approaches, one of which imposes requirements on some subcomponent that are difficult to achieve. That difficulty may not reveal itself until people have proceeded into the specification and design of that subcomponent. Only then may one realize that an alternative design for the original component is better.

Finally, the design needs to support all of the component’s or system’s specification. Rushing through the design increases the likelihood that some essential requirement will get missed, leading to problems later when the component is integrated with others, or when the system goes into operation, and a subtle failure occurs.

Balanced and incremental design. Modern, complex systems involve many different kinds of constraints on components. A component may need to meet all of structural, safety, functional, security, reliability, environmental, maintainability, user interface, and budget constraints to meet its specification and thus to function correctly in the system as a whole.

I have found that focusing too much on any one of these aspects leads to an unbalanced design that does not meet some other aspect. This can lead to repeated partial design followed by redesign after redesign, each time focusing on a different aspect.

The alternative is to consider a little of each aspect at the same time, working to find a rough design that looks like it will be going in a feasible direction for all of these aspects. After there is a rough design, one can go into greater depth on individual aspects with lower risk that the dive into one area will result in not meeting constraints on another aspect.

As one example, reliability and safety often work against each other. The safer choice is often to shut down a component rather than trying to keep it in operation after a failure. Conversely, the redundancy needed to increase reliability increases the complexity of the component, leading to more conditions that could lead to a safety violation.

Guide and explanation. Multiple people will use a design over the course of a project. While one person may develop the first design, others will analyze it for safety or security; still others will review the design for completeness or correctness; one or more people will use it to implement the component; other people will use it to develop and perform verifications. Later, other people will use the design to understand a component that may need a bug fix or feature change.

In other words, the design is for communicating among many different people and over potentially long periods of time, when the people who originally made the design are no longer available to answer questions from their memory.

For all those people who work on the component later, the design provides a guide to understand how the component is organized.

All too often, an engineer is asked to figure out why some existing software component is not working as expected. There is no design, just the source code. The engineer has to try to extract the design from the source code in order to figure out where the component is not behaving as it should. Extracting the design takes time and effort that could be avoided if the design could just be consulted. An extracted design is rarely accurate: the source code does not have a record of where there are subtle, unobvious aspects of the design; nor does it record why the design is what it is. The result is greater cost and time required to update the component, and a higher risk of a change introducing more problems than it fixes.

Decision rationales. A good design includes an explanation of why particular decisions were made. This information helps those who review and analyze the design to determine whether good choices were made. More important, the rationale informs the people who later need to update or redesign the component.

It is common that any electronic board component that is in production more than a handful of years will run into a situation where some chip is no longer available. The manufacturer has stopped making the original chip X, but another manufacturer is making a chip Y that is supposed to be pin-compatible with chip X. Is it okay to substitute chip Y for chip X? That depends on what it was about chip X that led to it being the choice. If the choice was just on the basic chip function, the substitution is probably okay. However, if the choice was based on something unobvious like the chip X’s radiation tolerance resulting from a particular lithography technique, chip Y may not be an acceptable replacement. The only way to know that the radiation tolerance was a key part of the decision is if someone writes down that rationale.

Supporting analysis. Many key component properties, especially those related to safety, security, or reliability, are emergent from the design. It is increasingly evident that these properties are difficult to retrofit into a completed design: they involve the fundamental organization of elements of the design.

This leads to approaches of security-guided or safety-guided design. In these approaches, the security or safety properties are considered from the start and included in the design. As the design progresses from a rough sketch to something more detailed, it can be analyzed with progressively greater accuracy to determine whether these properties are being met.

This approach is relatively inexpensive and easy when it is being done as part of the original design effort. A safety analysis can determine what high-level aspects of a control loop are essential for safe operation; a security analysis can determine what information flow properties must be met to maintain security. These analyses help early pruning of potential design approaches that would not meet safety or security needs.

The alternative is to proceed without including safety or security considerations, then having to go back and work out control or data flow on a more complex design, then repeat parts of the design process while undoing earlier decisions. Repeating work like this takes more time and effort, and is more likely to result in an implementation that has safety or security flaws.

40.2 How designs are used

As noted above, a design enables communication among multiple people, across different times, and for different purposes.

Developing the initial design. One or more people take the objectives, CONOPS, and specification for a component and eventually produce one or more potential designs for that component.

Developing the design is not a single, monolithic activity. It almost always proceeds incrementally, evolving the design from a rough sketch through multiple ideas that turn out not to be quite right until reaching a design that looks like it will meet the component’s specification. The designers will need to try out multiple ideas along the way, meaning that what they document will need to evolve as they try different approaches.

The process of assembling a design can be characterized as working through each of the elements of the specification, while at the same time matching the specification against the possible building blocks for the component. As a simple example, this might involve matching a specification for an electrical energy storage system to store X mAh of energy against a catalog of available battery products.

Actual component specifications involve multiple aspects, some of which will work against each other. A realistic electrical energy storage system must meet performance specifications such as the amount of storage, maximum safe current, reliability constraints, and a number of constraints related to safety. This leads to the recommendation that a designer consider many specification aspects at once, but only at a high level, before going into greater detail.

In the end, the designers must either show that the design they have created fulfills the corresponding specification, or show that the specification is flawed in some way and feed that information back to the people responsible for the specification to get it changed.

Evolving a design. Every design will evolve, both during the initial system development and over time as the system is used or upgraded or fixed. Any change to the design needs to be evaluated for its scope, its effects, and its correctness.

Evaluating scope and effects means determining what effects the change will have in addition to the specific change being considered. A change in one part of a component might affect some safety property of the component as a whole, for example. A change might also affect some behavior or structure that some other component depends upon, possibly indirectly across multiple intervening components. Substituting one chip for another in a board design might change the timing of some signal, which leads to a subtle change in the sequence of operations performed by software on another board, which in turn invalidates a monitor watching for faults.

Evaluating correctness involves checking that any analyses done on the previous design to show that safety, security, or other properties hold either continue to hold or that the analyses can be adjusted to show that the updated design still meets those criteria.

Navigating through the system. Many people will need to find things in the system over time—developers, reviewers, auditors, and many others. Virtually none of them will come in with a complete understanding of the system and its structure, so they will need a guide that helps them learn the structure of the system and to find where some behavior or feature is implemented.

The system design can support such users in three ways. First, the design can provide the breakdown structure, showing how the system is divided into components, those into subcomponents, and so on. The breakdown structure also groups related components together, so that a user can narrow down where they are looking. Second, the design can show how components are related to each other. If one component in one part of the system is providing feedback signals to a component in a different part of the system, making these relationships explicit provides a way for a user to trace out these interactions. And third, including explanations or rationales for why the design is the way it is helps educate the user about subtleties that are not going to be apparent from just reading about the structure, interactions, or behaviors.

40.2.1 Design leading into implementation

As well as all the uses listed above, the developer uses the design as a guide for the implementation. The resulting implementation must be consistent with the design: having the same structure and behavior, including all the functions in the design, and including no functions not in the design.

The developer or implementer must be able to understand the design to build a component that matches the design. The developer must also be able to check that they understand the design properly, so that there is a way to catch misunderstandings. A good design uses consistent structure, terminology, and diagrams to aid understanding. It provides a glossary of terms that may have multiple meanings to define how they are used in the design.

Developers will find problems with the design as they proceed through implementation. They may find ambiguities, where the design is unclear or where the design does not address some important condition. The developer may find errors, where the design is inconsistent internally or with its specification. The developer may find that parts of the design aren’t feasible to implement. All of these problems need to be fed back to designers for clarification or correction.

When the design changes, the developer needs to be able to identify what parts of the design have changed so they can change the corresponding implementation. The change might come in response to feedback from the developer, or evolution of the design to address changing needs or broader system fixes. This can be supported by using tools that track design versions and highlight design changes between versions.

Finally, the developer must be incentivized to follow the design (or provide correction feedback) as they implement the component. This includes having the designer and independent people review the implementation to compare it to the design. If they find that the design and implementation are not consistent, they must decide on how to change the design, the implementation, or both in order to achieve consistency. The component implementation should not be accepted as complete until they match.

40.3 Design artifacts

The artifacts that record the design enable all the usage cases listed above. The key functions they need to fill include:

40.3.1 Supporting infrastructure

The designs for a system need to be available to everyone associated with the project, so that they can use the design to learn about the system and navigate through it.

An ideal solution provides a “single source of truth”: a user can go to one place and see all of the information about the system. The ideal solution also ensures that the user always sees a single consistent version of all the information. To the best of our knowledge, at present there are no systems that completely meet this ideal. However, there are ways to come close by integrating multiple tools and applying conventions to how they are used.

40.3.2 The artifacts

The following sections list the key artifacts that should be part of a design. Later chapters will detail these artifacts.

40.3.2.1 Breakdown structure

The breakdown structure consists of the hierarchical relationship of system, components, and their subcomponents recursively. It gives a name or identifier to each component, and provides the index or table of contents to the parts that make up the system. See ! Unknown link ref.

40.3.2.2 Control structure and other large-scale behaviors

A complex system will have behaviors or structures that cross multiple parts of the system, and don’t neatly fit within a single hierarchy of components. There are two important examples of these behaviors to document.

The first example is behavior or activity sequences that show how different parts interact with each other. These are sometimes documented as UML or SysML activity diagrams, which show how control or data pass among components, and how different components take actions in response to those. The point of these patterns is to show how components work together, which informs the interfaces, actions, and states that the components involved in the activity must support.

The second example is the hierarchies of control that operate in the system. These document how one part of the system controls the functions in other parts, including how some components provide sense data to drive the control logic, and how the control logic in turn sends commands to other components to effect control actions. Documenting and analyzing these control systems is an essential part of some safety and security process methodologies, such as STPA [Leveson11].

40.3.2.3 Details of each component

Each component in the system should have its own design. This is the primary content about individual components, as opposed to how components work together.

A component’s design can be represented in many different ways. However, it is easiest for users if all designs follow the same general format so that they know how to find particular kinds of information within every design.

Each component’s design should include rationale: the reasons why different design choices were made. This information helps those who must come along later to review or update the design.

In some cases, not all of this information can be represented in one way or in one tool. For example, for electronics designs the best way to represent some information will be in a CAD drawing that is maintained in a separate tool from the rest of the design information. In these cases, there should be unambiguous references from the main design to the CAD drawing and vice versa, and the versioning in the main design should be reflected in versioning in the CAD tool.

40.3.2.4 Safety, security, and other analyses

Part of the reason for developing a design—as a simplified model of what will be implemented—is to enable analysis of the design’s essentials. These analyses address whether the design will meet aspects of the component’s specification. These can include safety and security, as well as meeting business objectives, regulatory requirements, performance specifications, or resource budgets.

As I will discuss in the next section, it is recommended practice to develop these analyses incrementally in parallel with the design itself. In this way, a rough analysis of a rough design can provide quick, early feedback that will guide the design toward meeting its specified properties as it is developed in more detail.

These analyses become an important part of the record of a design once complete. They provide an extended rationale for why the design is the way it is. They may be needed to answer to external stakeholders, including regulators or courts of law, when it becomes necessary to provide evidence why the design is acceptable. The analyses also help people who must later evolve the designs to understand both the constraints on what they can change, and where they have freedom to make changes without invalidating the safety or other properties of the design.

40.4 Developing designs

As a matter of principle, the design for a system or component should be done after its objectives and specification are done, and before its implementation. Similarly, the design for the components in a system should proceed top down, starting with the system as a whole and proceeding to lower and lower level components. When the design of one component depends on the design of another, the two should be designed together.

These principles often lead people to conclude that systems should be built using a waterfall-like process, where everything is specified before design, everything designed before implementation, and so on.

Real projects are not so simple. I have never observed a project that actually used such a process, even when they tried to. This is because every complex system I have encountered is not fully and accurately knowable in advance. One can write a set of specifications that turn out to require some impossible component design. One might miss some important system objective when developing the initial system concept because the customer was not able to conceive of system operation until they could see part of the system in operation, or because the customer’s needs change. An initial design may be invalidated because a supplier discontinues an essential part. Some part of the system may require significant investigation or research before one can find a feasible way to approach its design.

All of these situations lead to cases where the specification, design, and implementation of the system does not proceed in a tidy one-way sequence through the waterfall stages. Instead, part of a component’s specification gets worked out, and some tentative design goes ahead using that part of the specification gets worked out. Or multiple possible design approaches are defined, and then someone proceeds to build simple prototype implementations of two or more of them to compare their feasibility. Or the design for a component must change, leading to a change in implementation. All of these may be happening in multiple parts of the system at once.

At the worst, all this change happening all over a system can lead to chaos where people working on different components are working to incompatible specifications or designs and building parts that will not integrate into a system. Project management may not be able to determine how much progress has actually been made on any part of the system, and thus be unable to detect when there are schedule or resource problems.

Therefore while the simple waterfall model, which organizes the work on a system, is not feasible, there is still a need to organize development work.

40.4.1 Applying general principles, flexibly

Develop specifications, then design. When one designs a component without first working out what the rest of the system needs that component to do, one usually ends up with a design that doesn’t actually meet needs (once those are worked out). When a specification gets developed, the people involved will tend to look at the effort that has already been spent on designing (and possibly implementing) the component and will try to adjust the specification to fit that sunk cost—after all, that work has already been done, why should it be discarded? Unfortunately this tends over time to produce safety and security problems, and to dramatically increase the cost of the system as people try to integrate the wrong component into the rest of the system.

It is better to explicitly defer some design decisions until the specification is firm—but not avoid doing any design. (Doing no design until specification is done is not possible when the design activity can reveal problems with a specification.) Do a minimal amount of design, bearing in mind the risk that design may need to change as the specification changes, as well as the risk that the specification may need to change as design reveals problems.

Instead:

Keep the design officially tentative as it is being developed, so that people continue to treat the design as a work in progress that they are willing to modify.
Limit the amount of design effort while a specification is incomplete, so that one can check whether the specification can lead to a feasible design but while keeping the design open as the specification is revised.
Focus design efforts on fulfilling parts of the specification that either appear to be fairly certain, or on those parts of the specification where there appears to be risk of specifying something that can’t be built.
Track how the design matches the specification as the two evolve, so that when the specification changes one can accurately determine what parts of the design are affected.

Develop design, then implement. Similar to the way design reflects specification, the implementation reflects design. Proceeding with implementing a component before it has been designed is not really possible: doing so means that design is done implicitly and is left unrecorded. This leads to components that fail to meet functional, safety, or security constraints because those constraints have not been properly considered and analyzed before committing effort to implementation.

At the same time, deferring all implementation until all design is complete is a recipe for an infeasible system. It is all too easy to create a design that involves impossible feats of implementation, from requiring metals that do not currently exist (“unobtainium”) to algorithms that have not been invented.

I have found that a middle ground often works well. As I will discuss in future chapters on implementation, I have used a software implementation approach that emphasizes continuous integration (by which I do not mean continuous testing) and skeleton building for implementation, where the implementation proceeds in many small iterations. Using this approach the team can build a simplified implementation of the general structure of a component, focusing on those aspects where the design appears either to be relatively certain or where there is higher risk in the design that needs to be checked with a rough implementation.

I have also made a point of prototyping implementations of parts of a design in order to validate whether the design is feasible. I will also discuss prototyping in a future chapter.

There is a high risk with any implementation done before specification and design are solid, even when the implementation is done for good reasons (like prototyping to validate a design approach). The effort spent on implementing something is a sunk cost: it cannot be recovered. As the design evolves, there is a strong incentive to try to continue to reuse the implementation that has already been completed, as the incremental cost or time of modification is almost always perceived to be less than starting a new implementation from scratch. This leads to a sequence of incremental changes, each of which by themselves can be perceived as the lower-cost way of handling a sequence of design changes. However, it is often the case that after a few of these incremental changes, it will have become more cost-effective to have thrown away the initial prototype or implementation and started over with better information. This sequence of incremental changes also tends to result in an implementation that has many vestiges of implementations that are no longer applicable, but which continue to present a source of bugs, security flaws, or safety problems.

The cost of incrementalism is often apparent only in retrospect. It is also driven by basic business imperatives to minimize cost at each step, or to get features implemented as rapidly as possible. This is an example of an online optimization problem, which is often hard to solve well theoretically and even harder when human incentives are involved. The techniques used to solve similar online optimization problems (notably the ski rental problem ! Unknown link ref) apply. Limiting the amount of implementation effort that may be at risk for incrementalism by deferring as much implementation as possible until the design is solid helps avoid this situation.

Thus we:

Limit the implementation effort to building simple, low-effort skeletons or prototypes of a component implementation, rather than moving to building the implementation as a whole.
Deliberately make the initial skeletons or prototypes undeployable so that the temptation to use half-baked experimental implementation is reduced.
Focus the implementation effort on aspects of the design that either have low uncertainty, implying that the risk of having to re-implement is low, or where the design carries high technical uncertainty, implying that the value of a prototype feeding back information to the design is high.
Track the relationship between design and implementation, so that the implementation can accurately change as the design evolves.

Design top down and coordinate the design of interdependent components. Many aspects of a system’s design can only be developed effectively when they are developed from the top down, notably safety and security properties. That is because these properties apply to the system as a whole and are emergent from the designs of the components that make up the system. (See [Leveson11] for an in-depth discussion of this effect.)

However, designing from the top down creates risk, similar to the previous principles, that a high-level design may create unachievable specifications for lower-level components. There is also a risk that during high-level design the cost or time involved in developing some lower-level parts of the system is unknown. This can lead to effort being spent, unknowingly, on subcomponents that are simple to design and build while subcomponents that will take far longer to develop are left for later, leading to a drawn-out schedule.

Our recommendation for managing this risk is to sketch the design for multiple layers, creating a rough outline of a design for a component and some layers of its subcomponents, then proceeding to add detail to the high-level component and fleshing out the specification for its subcomponents. Proceeding incrementally in this way allows one to obtain some information about the feasibility and complexity of a particular design approach before committing all of one’s effort to the detail of the top-level component. This approach is similar to our recommended implementation approach of building skeletons or prototypes of components rather than immediately progressing to detailed implementation.

The same issues about the cost of incrementalism apply to top-down design as they do to implementation. It can be useful to make sketch designs that are not in the final form needed, to reduce the temptation to turn sketches that have been changed over and over directly into the design for a component.

40.4.2 Additional principles

Balance design work. I have found that focusing on one aspect of a component’s design to the exclusion of others often leads to dead-end designs, where a work in progress becomes too biased toward one aspect and is not readily evolved as other aspects begin to be considered. Focusing on primary features first, and leaving security or safety for later, is a common example of this pattern.

I have found it more useful to consider many different aspects of a component’s design at a high level, sketching out different rough possible designs and making simple comparisons as one learns about the design problem. This approach has the advantage of investing relatively less effort on detail design and analysis while the design has a higher degree of uncertainty, and focusing effort on those approaches that pass the first simple evaluations.

This approach to design has its pitfalls. Some components’ designs are constrained by particular aspects—such as a need for high performance or the ability to operate in an extreme environment. These aspects are sometimes called design drivers: they have a disproportionate effect on the final design. Recognizing when some aspect drives the design in this way, and putting more effort earlier into understanding these drivers, is part of the art of designing well.

Plan for updates. Nearly every design in a successful system will be updated as time goes by. Over time, the effort spent on these updates will dwarf the effort spent on the initial design. This means that if one is developing a system for the long run, the processes, tools, and artifacts used in the design effort should be organized in a way that supports those who will come along to learn about, evaluate, and redesign parts of the system—long after those who initially designed it have moved on.

This necessitates documenting more than just the structure of the implementation. For these people to understand a design, they need to know the thinking behind what the choices were and the subtle aspects that are not necessarily apparent from looking at the implementation. These people will need guidance for how components relate to each other. They will need to understand the analyses that determined whether the component’s design was sufficiently safe or secure. This documentation takes more effort than proceeding through a one-time design, building an implementation, and then moving on, but it provides a project with a future.

Making updates effective also involves creating a team structure and human processes that can handle updates. This involves giving the team a clear way to understand how design changes happen, and how to distinguish proposals or work in progress from a design they should work from, or how to determine what design applies to a specific deployed system. It also involves developing a team culture that incentivizes good design and good documentation, giving them enough time to document enough design that their successors can build on their work and avoiding creating unnecessary time pressures that disincentivize people from doing good design.

Use appropriate infrastructure. Finally, effective design relies on having the tools, processes, and standards that give people the tools to do design work. The key principles I recommend include:

All designs should be maintained in a single repository, so that everyone on a project knows where to access the information and everyone works from the same designs. This is often referred to as having a “single source of truth”. As noted earlier, this is not always possible when some tools will not integrate with others; in that case, each particular kind of design information should be maintained in one place and there should be clear linkages between a document in one repository and a related one in a different repository.
All design information should be versioned, so that everyone on the project can determine whether they are using the same version of a particular design artifact. Versions should be for more than just single artifacts; they should define a version of the entire system, so that the artifacts in one version are consistent with each other.
Design artifact versions should make it clear when a version is a proposal or work in progress, a version being reviewed, a version that has been baselined (meaning approved for others in the team to use), a version that applies to a deployed or released system, or a version that has been superseded. There should be a common standard for how these different kinds of versions are related to each other and labeled.
Design artifacts should use common formats or design standards. This includes a common format for the basic information about a component’s design, common formats for drawings (such as using SysML/UML, or CAD standards), or common formats for reporting analyses.

Chapter 41: Breakdown structure

The component breakdown, or breakdown structure, is the way to name and organize all the components that make up a system.

41.1 What is the component breakdown for?

The component breakdown organizes and names all the pieces in the system. It serves three main purposes:

41.1.1 Component breakdown versus work breakdown

Some institutions, notably NASA [NPR7120][NASA18] and other parts of the US Federal government [DOD22], specify the use of a work breakdown structure (WBS) in project management and systems engineering. A WBS as used in those projects is different from a component breakdown structure as defined here.

A WBS is oriented toward project management, not systems engineering. It is focused on defining the work to be done (hence the name) rather than the items or components being built by the work. From the NASA WBS Handbook [NASA18, p. 35]:

Other project management methodologies define a work breakdown structure as, in effect, a checklist of the kinds of work that may be required for a system, feature, or component. McConnell discusses using a generic work breakdown structure in estimation to ensure all the effort involved is accounted for [McConnell09, Table 10-3].

This difference in intent leads to two major differences in the contents of a WBS compared to a component breakdown. The first is that a WBS includes work items that are not product artifacts. The standard NASA WBS, for example, includes project management, systems engineering, and education and public outreach branches of the work breakdown tree [NASA18, p. 47]. Given that part of the goal of the WBS is to organize resources and budget for a project, that’s an appropriate choice. The other difference is that some people break a task for building a component down into multiple revisions or releases. For example, a “motor control software” component might have subitems “prototype”, “release 1”, and “release 2”, recording the phases of work done to develop that software package.

The component breakdown structure presented in this chapter is narrower in focus than a WBS. The component breakdown lists only the things that are being built. It must be complemented by other engineering and management artifacts to provide everything needed to run a project.

41.1.2 Component breakdown versus other views

The component breakdown is one of several views into the system’s design and specification. The component breakdown has only two purposes: listing all the components and giving them unique names, and providing a structure that people can use to navigate through the components to find one they are looking for.

The component breakdown is not for expressing other facts about components and relationships between them. There are other views and other breakdowns for representing that information—and for doing so in ways that are better suited to the specific information that needs to be explained. For example, a network or wiring diagram does a better job of illustrating how multiple hardware components are connected together. Mechanical drawings are a better way to show how components relate to each other physically. Data and control flow diagrams, perhaps realized as SysML activity and sequence diagrams, are better suited to expressing relationships between software components.

41.2 Basic concepts

When developing a component breakdown, the first question to be settled is: what is a component?

First, a component is something that people think of as a unit. Terms like “system”, “subsystem”, or “module” are all clues that people think of a thing as a unit. More generally, a component is something

Components do not have to be atomic units. Systems have subsystems; components have subcomponents. For example, the electrical power system (EPS) in a spacecraft is a medium-level component in a typical breakdown structure. It is part of the spacecraft as a whole. It is made up of several subcomponents: power generation, power storage, power distribution, and power system control. Each of those subcomponents in turn have constituent components themselves: for example, power generation has solar cells, perhaps arrays that hold the cells, perhaps some other power generation mechanism.

This illustrates the general pattern for the breakdown structure. The structure is a tree, with the highest-level component being the system as a whole. The system as a whole is typically not just a vehicle or box; it is the entire mission or business on which a vehicle is part. Underneath the whole system come the major component systems. For a spacecraft mission, this might be the spacecraft, ground systems, launch systems, and related assembly and test systems. The next level of components are the major subsystems. The structure continues recursively until reaching components that are the smallest that are sensible to model using systems tools.

The recursive process of defining smaller and smaller components ends when there is a judgment that further subdivision won’t help the systems engineering process. In practice, for example, continuing the breakdown structure all the way to individual resistors and capacitors on a printed circuit board is too detailed to be useful for systems engineering tasks.

Some criteria I have used for deciding when to continue subdividing a component into subcomponents include:

41.2.1 Satisfying the objectives

41.2.2 Alternatives

The approach laid out here is fundamentally hierarchical, and reflects the way people usually approach breaking down a complex system—by a reductive approach that organizes parts into a hierarchy.

That is not the only approach to organizing the components. Mechanical and electrical engineering systems often use a more-or-less flat space of part numbers to identify components. The specifications for each part can have attributes, and the attributes allow one to search for a desired part.

A flat part number approach works well for low-level, physical components. A 100 ohm resistor can be used in many different components; there is little value in giving a different name for its use in one place on one board and a different name for a second place on that board, or on a different board. Similarly, when manufacturing many instances of a vehicle, using a part number to identify the part in an assembly works well.

I have generally not used a part number approach for higher-level systems activities, however, because the uses are not the same. During design, each component that systems engineering deals with is generally unique.

41.3 Component identifiers

A component’s identifier provides a unique way to refer to that component. It is like the address for a building: it allows one to find the component (or its specifications), but does not by itself convey much more information. The keys are that the identifier be unique, and that people can use the identifier to find what they are looking for.

The pathname is the long-standing practice for creating identifiers for elements in a hierarchy. This is familiar from file systems and URLs: the path /a/b/c/d refers to a file or object named “d”, which is contained in “c”, which is in turn contained in “b”, which is part of “a”, which is one of the top-level objects or folders in the system. While the object name “d” is not necessarily unique (there can be another object /a/f/d, for example), the path as a whole does give a unique identifier for the object or file.

This approach applies to the identifiers for components in a breakdown structure as well. The names in the path are typically separated by a slash (/) or period (.).

The names of each component in the tree can be abbreviations or short words describing the component. Both work well; the choice is primarily a matter of style. When there are commonly used abbreviations for some components, it is reasonable mix and match abbreviations and longer names. For example, a spacecraft’s computing system is often called the CDH (command and data handling); attitude control is the ACS (attitude control system); and the electrical system is the EPS (electrical power system).

Long component identifiers can become a problem. Long identifiers are harder to type than shorter ones. Sometimes there are limits on how long an identifier can be; for example, if one is recording information about components in a spreadsheet and putting each different component on a different sheet, most spreadsheet packages have limit on how long a sheet name can be.

The length of an identifier is driven by how deeply the breakdown structure tree goes. The path name for a component six layers down in the hierarchy will be much longer than the path name for a component in the third layer. This suggests that one should try not to make the component hierarchy any deeper than it needs to be.

Abbreviations	Short names
sc	spacecraft
sc.eps	spacecraft.power
sc.eps.batt	spacecraft.power.battery
sc.cdh.fp	spacecraft.cdh.flightprocessor

41.4 Viewing the breakdown structure

Many people find a visual representation of the breakdown structure helpful for understanding it. Here is a drawing of an incomplete breakdown structure for a simple spacecraft:

It is worth finding tools that can show this kind of visual representation of the breakdown structure.

41.5 Context and relationships

The breakdown structure provides the fundamental organization for most systems engineering artifacts. This means that the structure chosen for the breakdown will affect how most other parts of a specification are organized.

Each component named in the breakdown has a specification. The specification includes information like

When two components interact, the interface between them must name which components are involved. The specifications for each component must indicate what data or control they will be sending and receiving in the interaction.

The identifier for a component provides a way to express a reference between implementation and test artifacts, like source code or drawings, and the specifications to which they should comply.

The breakdown structure affects almost everyone working on the project. This includes:

41.6 Advice

41.6.1 Evolution

The understanding of the system evolves gradually from the initial concept to the time that a final product is delivered (if indeed there is a final product). At each step of this evolution, the understanding of what should be in the breakdown structure and how it should be organized will change.

Because the breakdown structure is central to many other processes and artifacts, a change to the breakdown structure will result in changes to potentially many other artifacts. The cost of the change grows as the size of the breakdown structure tree grows.

Don’t try to build an elaborate and complete breakdown structure too early. At the beginning, while still working out the basic concepts of the system and its structure, just sketch out the first level of the structure—and try out several potential structures until one appears to match the system’s objectives. Often the main structure will be suggested by common practice for similar projects: the automobile industry has a common, vernacular breakdown of cars and trucks into common subsystems, for example.

In general, it is best to keep a branch of the breakdown structure shallow as long as there is significant uncertainty about how that part of the system will be designed. In an aircraft, for example, the propulsion system should be left unrefined in the breakdown structure until the team has settled on the general approach to propulsion—will it use turbofans, turboprops, propfans, electric rotors, or some combination? The broad choice can typically be settled early in concept development by working out the concept of operations and determining what capabilities, performance, and physical layout will meet the aircraft’s operational needs. Once the general architecture has been decided, then one can refine the propulsion system by adding a layer of components for each engine or other major unit involved in propulsion.

41.6.2 Depth

The point of the breakdown structure is to help people find and refer to components. The breakdown structure should reflect common ideas of how a system breaks down into components, and should result in short, easy-to-use identifiers. The breakdown structure should focus on these capabilities and not be drafted into serving other purposes.

Consider the breakdown structure for all the sensors that provide information to an autonomous vehicle. One way to organize the sensors is to create a general “sensors” component, and then include all the sensors as children of the general sensors component. Another way is to break the sensors down first by general type (camera, lidar, radar, sonar, microphone), then by general location of the sensor on the vehicle (front, left, right, top, back), and then by the specific sensor unit. In this example, the first approach leads to a shallow and broad breakdown structure; the latter example leads to a narrow and deep structure.

In general, a shallow, broad breakdown structure will meet these objectives better than a narrow and deep structure. There are a few reasons for this.

This leads to a general principle. The breakdown structure should be used only for providing a unique name, and not for embedding a taxonomy or search attributes. The tools that people use to navigate through the breakdown structure and its related artifacts, like specifications, should provide search mechanisms that let someone find a component by attributes. Embedding extraneous information, like a location attribute or model number or power requirement in the name will just make the names longer, harder to use, and less resilient to change.

41.6.3 Multiple fit

The hierarchical, tree-structured approach recommended here makes each component part of exactly one parent component. It does not accommodate components that have more than one natural affinity to parent groupings.

Consider a radio transceiver that is used to communicate between aircraft, such as the ADS-B systems used for collision avoidance. This transceiver could be categorized multiple ways. It is part of the aircraft, but it is also part of an air traffic management safety system. The transceiver within the aircraft is part of a communication system, but it is also a part of the flight control system and intimately connected with human interface components on the flight deck. The transceiver, in other words, is part of several different groupings of components, depending on who is looking and for what purpose.

There is a fundamental tension between simple organizing structures, like a tree, and the richer relationships that elements of a system have with each other. For an excellent discussion of this, see Alexander’s essay on trees as a structuring approach for cities [Alexander15]. In that essay, Alexander proposes that a lattice structure is a more appropriate model for organizing urban structures. In his account, a tree-oriented description of a city fails to account for the ways that a house can be both a place for a family to live as well as a node in a social network and a place of work; in each of these roles, the house is related to different buildings or locations in the city.

The systems engineering approach presented here addresses this problem by separating naming or identity from the complex relationships that each component actually has. The breakdown structure only tries to give a name to each thing, like the address for a building. The relationships, functions, requirements, and everything else that goes into defining a component are all left to other artifacts, such as the component’s specification and models of the components.

This means: don’t try to make the breakdown structure do too much. When a component fits into multiple categories, pick the one that seems most natural for most users and leave it at that. Other artifacts and tools will address greater complexity.

41.6.4 Not by function

The breakdown structure is for organizing components: things that are built and that can be seen or touched (possibly virtually).

There is sometimes a temptation to try to organize system functions into the breakdown hierarchy. Don’t do that. The breakdown of function—and of the allocation of function to component—is a separate task that needs to be addressed by a structure that focuses on how functions are organized.

A better approach is to maintain the component breakdown and a functional breakdown separately, and maintain an allocation mapping that shows how different subfunctions are achieved by different components. The functional breakdown is often better reflected in the structure of how specifications or requirements derive from each other. See the chapter on requirements for more on this.

41.6.5 Keep related things together

Some projects have proposed organizing components primarily by some fundamental, nonfunctional attribute. One project was considering separating hardware from electronics from software from operational procedures at the top level, and then organizing components within each of those categories by subsystem. Another project organized components first by the vendor organization that was to implement the component.

These approaches make it harder for people to use the breakdown structure to find things. Consider an electrical power controller on a spacecraft. This has an electronic component (the board and processor that runs the power control function) and a software component (that makes the decisions about what to power on and off, and to report information to a telemetry function). Someone working on the power controller will generally want to know about both aspects. Requiring them to look in two widely-separated parts of the breakdown structure is inconvenient, and (more seriously) it increases the chances that someone will miss a component that they need to know about to do their work.

As a general principle, it is better to group components by how people naturally think of them as being grouped. Keep functionally-related components close together in the breakdown structure so that people find everything they need about something by looking in one place.

As noted above, this doesn’t always work. The breakdown structure will not be perfect because not everything in a system naturally falls into a hierarchical organization. But the more that like things can be grouped, the easier it will be for people.

41.6.6 Generic and reusable components

There is one special case of a component fitting into multiple places in a breakdown structure that deserves special treatment: generic and reusable components.

Consider an operating system. There may be multiple processors within a system that may all run instances of the same operating system. It is useful to have one specification for that operating system: there’s one product that is acquired from a vendor, there is one master copy kept somewhere, and so on. At the same time, that operating system will be loaded onto many different processor components in different subsystems.

One way to address this is to have a part of the breakdown structure for generic components, and then put an instance of that component in the places where it is used. The specification of each instance component can refer to the specification for the generic, with those functions or requirements that are specific to the instance added. This is an example of using the class-instance model from object-oriented programming to solve the problem.

41.7 Examples

41.7.1 NASA Work Breakdown Structure

The NASA project management process and systems engineering standards use a common WBS structure across all NASA projects. The use of the WBS is codified in a Procedural Requirement document [NPR7120], with details in an accompanying handbook [NASA18].

The NASA WBS is used as a project management artifact to organize work tasks, resources and budget, and report progress. The hierarchy must “support cost and schedule allocation down to a work package level” [NPR7120, p. 113]. A “work package” means one task or work assignment that is tracked, budgeted, and assigned as a single unit.

A NASA project’s WBS tree is rooted in the official NASA project project authorization, with its associated project code.

The first level of elements is defined by NASA standards, and each element has a standard numbering. The standard elements for a space flight project are: [NPR7120, Fig. H-2, p. 113]:

Note how this organization mixes technical artifacts (payloads, spacecraft, ground systems) and management activities (project management, safety and mission assurance, public outreach).

The NASA WBS is intended to be one part of an overall project plan document. The project plan also contains information like:

41.7.2 MIL-STD-881 Work Breakdown Structure

This breakdown structure standard aims to provide a “consistent and visible framework” [DOD22] for communicating and contracting between a government program manager and contractors that perform the work. It addresses needs such as “performance, cost, schedule, risk, budget, and contractual” issues [DOD22, p. 1]. This kind of WBS is thus focused on supporting contractual relationships with suppliers.

The standard defines a number of different templates for different kinds of projects. It includes templates for aircraft systems, space systems, unmanned maritime systems, missiles, and several others.

As should be clear from this example, this WBS template aims to address not just the design and building of a system but rather the operation of the entire program, including testing, deployment, and initial operation.

41.7.3 A simple spacecraft system

This is an example component breakdown for a simplified imaging spacecraft. The spacecraft uses solar panels to collect energy; it has a single imaging camera to collect mission data; it has a flight computer to run the system; an attitude control system to point the imager where needed; and a radio to communicate to ground. (The graphical version of this breakdown structure is included earlier in this chapter.)

This example only goes four levels deep. The actual breakdown structure would likely include at least two more levels, to represent, for example, different parts of the flight control software or subcomponents of the radio transceiver.

The example includes an example of a component that could fit in multiple places in the structure: the propellant tank heater. This is part of the thermal management system—its function is to keep the fuel in the propellant tank within a certain temperature range—but it is also part of the propulsion system. In this example the choice was to categorize it as part of the thermal management system.

Id	Title
space	Space segment
space.acs	Attitude control system
space.acs.control	Control logic
space.acs.sun	Sun sensor
space.acs.wheels	Reaction wheels
space.cdh	Command and data handling avionics
space.cdh.gps	GPS receiver
space.cdh.gps.ant	Antenna
space.cdh.main	Main processor
space.cdh.storage	Data storage
space.comm	Communications system
space.comm.ant	Antenna
space.comm.ant-tran	Cable
space.comm.trans	Transceiver
space.eps	Electrical power system
space.eps.battery	Battery
space.eps.controller	Power controller
space.eps.panels	Solar panels
space.eps.sep	Separation switch
space.harness	Harnesses
space.harness.canbus	Data CAN bus
space.harness.pl	Payload harness
space.harness.power	Power cabling
space.harness.radio	Radio harness
space.pl	Payloads
space.pl.imager	Imager payload
space.prop	Propulsion system
space.prop.lines	Fuel lines
space.prop.tank	Fuel tank
space.prop.tank.pressure	Pressurization system
space.prop.tank.sensor	Fuel pressure sensor
space.prop.thruster	Thruster
space.structure	Structure
space.thermal	Thermal management system
space.thermal.propheat	Prop tank heater
space.thermal.radiator	Thermal radiator

[ARP4754]	Guidelines for Development of Civil Aircraft and Systems. SAE International, Standard 4754 rev. A, December 2010.
[Alexander15]	Christopher Alexander. A City is not a Tree. Sustasis Press, Portland, Oregon, 2015.
[Asimov50]	Isaac Asimov. I, Robot. Gnome Press, New York, 1950.
[BCP14]	Scott Bradner. Key words for use in RFCs to indicate requirement levels. Internet Engineering Task Force (IETF), Best Community Practice BCP 14, March 1997. https://www.ietf.org/rfc/bcp/bcp14.html.
[Cubesat22]	CubeSat Design Specification. The CubeSat Program, Cal Poly SLO, San Luis Obispo, CA, Standard Rev. 14.1, February 2022. https://www.cubesat.org/s/CDS-REV14_1-2022-02-09.pdf.
[DOD10]	DoD Deputy Chief Information Officer. DoD Architecture Framework Version 2.02. Department of Defense, United States Government, August 2010. https://dodcio.defense.gov/Library/DoD-Architecture-Framework/.
[DOD22]	Work Breakdown Structures for Defense Materiel Items. Department of Defense, United States Government, Standard Practice MIL-STD-881F, May 2022. https://cade.osd.mil/Content/cade/files/coplan/MIL-STD-881F_Final.pdf.
[FAA11]	System safety analysis and assessment for Part 23 Airplanes. US Department of Transportation, Federal Aviation Administration, Advisory Circular 13.1309-1E, November 2011.
[ISO26262]	Road vehicles — Functional safety. International Organization for Standardization, Geneva, Switzerland, Standard ISO 26262:2018, Second ed., 2018.
[Leveson11]	Nancy G. Leveson. Engineering a safer world: systems thinking applied to safety. Engineering Systems. MIT Press, Cambridge, Massachusetts, 2011.
[Lutz14]	Bob Lutz. How bad cars happen: the Pontiac Aztek debacle. Road & Track, 10 October 2014. https://www.roadandtrack.com/car-culture/a6357/bob-lutz-tells-the-inside-story-of-the-pontiac-aztek-debacle.
[McConnell09]	Steve McConnell. Software Estimation: Demystifying the Black Art. Microsoft Press, Redmond, Washington, 2009.
[McNamara22]	Paul McNamara, and Frederick Van De Putte. Deontic Logic. In The Stanford Encyclopedia of Philosophy. Edward N. Zalta, and Uri Nodelman, editors. Metaphysics Research Lab, Stanford University, Fall 2022 ed., 2022. https://plato.stanford.edu/archives/fall2022/entries/logic-deontic, accessed 3 February 2025.
[NASA16]	NASA Systems Engineering Handbook. National Aeronautics and Astronautics Administration (NASA), Report NASA SP-2016-6105 Rev2, 2016.
[NASA18]	NASA Work Breakdown Structure (WBS) Handbook. National Aeronautics and Astronautics Administration (NASA), Handbook SP-2016-3404/REV1, 2018. https://essp.larc.nasa.gov/EVM-3/pdf_files/NASA_WBS_Handbook_20180000844.pdf.
[NPR7120]	NASA Space Flight Program and Project Management Requirements. National Aeronautics and Astronautics Administration (NASA), NASA Procedural Requirement NPR 7120.5F, 2021.
[Polya57]	George Polya. How To Solve It. Doubleday & Company, Second ed., 1957.
[Thompson87]	R. G. Thompson. Dash 80: The story of the prototype 707. Smithsonian Magazine, 30 April 1987. https://www.smithsonianmag.com/air-space-magazine/dash-80-81791575/.