Antman writes software

Reliable HTTP: Outsmarting the Two Generals with Webhooks

Anthony Manning-Franklin — Mon, 15 Apr 2024 01:57:36 GMT

The Two Generals Problem is a mathematical theorem proving that no messaging protocol can reliably ensure that two parties share the same state.

However, some approaches guarantee that two distributed systems will follow an acceptable state progression over time. That is, state changes progress linearly and deterministically. It either halts or continues but never enters an unrecoverable state, e.g., missing one message in a series of messages.

One of the most common sources of problems in applications is the misuse of HTTP in machine-to-machine communication. In this article, we will examine system design patterns that significantly improve the reliability of HTTP communication between systems.

The first step to solving these problems is understanding the distinction between a delivery guarantee provided by message producers and a processing guarantee provided by message consumers.

Guarantees come in three flavours:

Atmost-once: 0 or 1
Atleast-once: 1 or more
Exactly-once: 1

Delivery guarantees are either "atmost-once" or "atleast-once".
A processing guarantee could be any one of the three, depending on the delivery guarantee provided by the producer.

Next, we need to understand that every HTTP request/response cycle is TWO messages over one connection:

The connection opens
The requester writes their message
The responder writes a reply
The connection closes

Caveat: Steps 1 & 4

The Fundamental Problems of Software

Anthony Manning-Franklin — Sun, 26 Nov 2023 03:31:09 GMT

As far as I can tell, there are six immutable fundamental problems faced by all commercial software.

Identifying the correct problem to solve
Getting the right specifications to solve the problem
Distributing a shared understanding of the specifications
Implementing the specifications
Verifying correct implementation
Repeating the above process in the context of refining the problem and specifications over time

The hardest problems are 1, 2, 3, and 6.

#1: Problem Discovery & Definition

Identifying the correct problem to solve is difficult because human psychology is hard. No matter the nature of the problem, someone only wants us to solve it because of how the person or people purchasing the solution expects that solution will make them feel.

I dont care what problem youre solving, at the end of the day youre working on it because of how one or more people feel about it. But Anthony, Im writing software for NASAs next Mars rover, you say. Sorry, but we only want it because humans feel curious about the world and perhaps want some greater sense of mastery over an indifferent universe.

Every software projects greatest risk is a lack of sufficient value. If what we write doesnt meet a need sufficient for someone to part with their money, our software wont be run and we probably wont be paid to write it much longer.

#2: Solution Discovery & Design

The next most challenging aspect of a software project is actually figuring out how to solve the problem. This solution needs to be fit for purpose in multiple dimensions.

It needs to

be understood and usable for its users (those pesky humans and their psychology again!)
be cost-effective for the purchaser.
be economically viable for the business providing it.
meet the requirements of customer support teams.
meet legal and regulatory requirements.
be feasible to implement.
meet the performance, reliability, and durability expectations of its users.
meet the maintainability, changeability, testability, and observability requirements of the team working on it over the life of the service (human psychology strikes again!)
meet the financial reporting requirements of the business.
meet the business intelligence needs of the business.

All aspects of this multidimensional fit need to be considered across many different time scales. What race condition could occur if two actors issue a command within a few hundred milliseconds? How many requests per minute can it handle? How many months until the database needs to scale? How many years ago was this piece of code last changed?

As humans, we tend to solve problems by thinking in 2D-dimensional Euclidean space. If we can draw it on a screen or a page, we can expand our thought process beyond the limited working memory of our brains. And we can even share it with others. But even if we drew sequence diagrams, entity relationship diagrams, process maps, component diagrams, deployment diagrams, infrastructure diagrams, network diagrams, state machine diagrams, interaction overview diagrams, or even the C4 diagrams system context, container, component, and code we still would not be able to verify we have a correct and valuable solution until we run code for human use.

And this brings us to the heart of the third fundamental problem of building software.

#3: Shared Understanding

Not only is it effectively impossible to perfectly share the correct specification with a team for implementation, but we cant know how correct our specification is until after weve built it!

So the specification will change while were implementing it, and every person implementing it will have a slightly different understanding of what the specification even is. Worse still, every person will have a slightly different understanding of the problem the specification is supposed to solve.

That means that while finding the correct problem and correct solution is a matter of human psychology, creating software is a problem of organisational psychology! Software is written, sold, maintained, and improved in a technosocial and psychosocial context.

For this reason, every software leader should endeavour to find ways to keep both their individual teams as small as possible, as well as their total headcount. Interpersonal complexity increases combinatorially.

But this is in direct tension with the fact that creating a defensible business requires building a large system to address the essential complexity of a valuable (difficult) problem. If the problem could be solved by one or two people, it likely already would be.

#4 & 5: Implementation and Verification

Funnily enough, problems 4 and 5 are the easiest parts of creating a software solution, yet theyre the focus of most of the literature on creating software. Most books, articles, YouTube videos, and online courses are about implementation. How come? Probably because this part of the problem is easily repeatable, consistent, verifiable, and locally reproducible. I can run a Kubernetes cluster on my local, but not the customers mind.

That's not to say it isn't important. The people working on this part, the engineers, must have a thorough understanding of the fundamentals of their craft. The stronger their fundamentals, the more capable they are of reducing a problem to its essence and finding the simplest possible solution.

#6: Software Over Time

Problem 6 is the problem of our solution existing and changing over time. It is the fact that this whole process is recursive. Our software solution begets new problems, requiring us to either change our existing solution or implement new ones. This is where software engineering emerges out of programming over time.

Given that most of the complexity, and all of the value, of a software solution exists after it is delivered, this aspect is vitally important. Yet, humans tend to be bad at predictions and timescales. There is an enormous wealth of information addressing this problem from a technological perspective with engineering solutions, e.g. Site Reliability Engineering, but this problem also consists, perhaps equally, of an anthropological component.

Every technological component forms a feedback loop with the people working on it. The people working on it are in a feedback loop with their adjacent teams, the wider business, and the customers. The impact of supposedly technological choices such as branch protection rules or choosing an infrastructure-as-code (IaC) tool is mostly anthropological over a sufficient timescale. That branch protection rule will become your pull request review culture, which will become your team dynamics and hierarchy. That IaC tool choice might determine whether your teams become siloed.

Who Solves These Problems?

When we look at the approach of the people solving these problems, a maturity model emerges. One I see professionals grow through over time. This model consists of three archetypes.

First is the coder, who focuses on problem #4, and a little on #5. Their impact is largely out of their hands, they simply implement the solutions asked of them.

Next is the software engineer, who focuses on problems 4, 5, and 6. They also start thinking about problems 2 and 3, and sometimes problem 1. This person is increasing their impact over time, rather than optimising for the short term.

Finally, we have the product engineer. This person works to solve all six problems. Theyre not just leveraging their impact over time, but also in multiple social dimensions. They build trust within their organisation to improve the multidimensional fitness of their solutions by receiving candid feedback, as well as vulnerable expressions of the problems and concerns faced by their colleagues. They also know they need trust across the organisation because aspects of their solution will inevitably be incorrect. Without sufficient trust, they won't get the opportunity to improve their solution. Product engineers aren't just communicating within their group, but also with customers. They know the human context where their solution is used will determine its usefulness. Theyre deadly focused on understanding the problem.

Ultimately, the lesson in all of this is that anyone working in software needs to aspire to better understand people. To become a better communicator, to grow their empathy, to become a better person. The technology is just the tool. It's not enough to love our tools, or even the solutions they create. We need to love the people we're building software with, and the people we're building software for.

The Four Quadrants of Complexity

Anthony Manning-Franklin — Sun, 05 Nov 2023 02:15:36 GMT

Essential complexity is where software engineers are uniquely able to deliver business value, whereas accidental complexity is what some engineers think looks good on their resumes.

Another dimension, shown in the diagram below, is frequency. This ranges from frequently repeated complexities to once-off complexity.

Repeated complexities can be an area of the system that suffers a high rate of change, or it can be a cross-cutting concern that re-appears frequently throughout the codebase.

These two dimensions give rise to four quadrants:

Repeated accidental complexity: a time sink where engineering productivity goes to die.
Once-off accidental complexity: a necessary evil. You get through it and hope you never need to touch it again.
Repeated essential complexity: leaves people feeling good about wasting time. The hardest thing about repeated essential complexity is detecting it, but boredom or attrition of talented engineers might be a sign.
Once-off essential complexity: the holy grail of software engineering it delivers business value with minimal cost and great ROI.

What are the fixes?

For repeated accidental complexity, you either need to eliminate the complexity entirely (do you really need Kubernetes? No, really, do you?) or create an abstraction, API, or framework that transforms repeated tasks into once-off tasks. But be warned, this only shifts the complexity left. Natural entropic forces push complexity to the right. One day, it could blow up into a time-sink again.

For once-off accidental complexity, the fix is to leave it alone. Take engineers off it, prioritise and champion other work. Sometimes, the source of the entropic force pushing complexity to the right is your teams desire to improve things! But if it aint broke, dont fix it.

Repeated essential complexity first needs to be detected, then, like its accidental cousin, the correct abstraction, API, or framework needs to be created to shift left. But note that youre paying for this with what you hope is once-off accidental complexity. There are several traps here, such as excessive code reuse leading to inappropriate coupling that makes changes harder.

For once-off essential complexity, the fix is to focus on solving the problem in front of you. Dont overcomplicate it. Dont reach for new tools. Dont try to prematurely optimise.

In general, good tooling helps shift all complexity a little left. Whether thats paying for a proper IDE rather than VSCode, setting up linters, debuggers, or static analysis tools.

Platform teams are also another solution, but one that many companies reach for too soon. You need many engineers for accidental complexity to cost enough to give platform teams a positive ROI. Buy one too early, and they might start trying to fix stuff that aint broke.

Of course, non-functional requirements introduce a lot of complexity that looks accidental but turns out to be essential or vice versa. Much of technical leadership is helping teams distinguish the two.

Implementing the Outbox Pattern in Nodejs and Postgres

Anthony Manning-Franklin — Sun, 23 Apr 2023 09:20:37 GMT

As applications scale, infrequent problems become significant. A network failure for 0.1% of requests is trivial at 1,000 requests per day, but a nightmare for customer support at 1,000,000 requests per day. This commonly happens when we have external dependencies following a successful operation. For example, if we want to send an SMS to a customer after they book an appointment then we will need to:

Insert their booking information into our database
Send a HTTP request to our SMS service provider

Let's pretend that it is really important that users receive this message, and receive these appointment confirmation messages in the order they were booked.

If we do both actions when a customer sends a request to our appointment/book endpoint, what happens if one fails but the other succeeds? Do we really want to cancel appointments due to latency or temporary errors from an external service? Will we hold open our database transaction until the SMS provider confirms or denies our request? Of course not. We should avoid letting an external service interfere with our application's key value proposition. Holding open database transactions for external services will lead to connection pool exhaustion in our service, especially when the third party's response times increase. Yuck!

How can we ensure that a text message is sent, and retried on failure, after an appointment is booked? By using The Outbox Pattern. This pattern is especially useful when messages must be dispatched in the correct sequence, such as the payment processor example in the image below.

In this article, we will use the Outbox Pattern to ensure that an HTTP request to appointment/book only inserts a booking into our bookings table, while a background process sends an SMS message for each record in the table.

What are some of the implementation concerns our solution needs to address? They are:

Ensuring at least one message is sent per row.
Allowing this background process to run on multiple machines or processes without any risk of sending duplicate messages.
Ensuring messages are dispatched in the order appointments were booked.
Durability in terms of instance termination or process failure. I.e. messages aren't lost because a deployment occurred or a server crashed.

Let's get building!

Implementation

For our solution, we are going to use TypeScript, nodejs, pg, and Postgres with SKIP LOCKED. It will consist of:

A cursor repository that manages the transactions, locking, updating, and unlocking of a cursor
An outbox process that checks for new records to process
A handler that runs our action for each record

This approach allows us to create multiple outboxes or reuse the code for other purposes such as read model populators in an Event Sourced system.

The Cursor Repository

For our cursor repository, we want to allow callers to

Retrieve a cursor for an outbox if another process is not holding a lock on it
Update the cursor value
Unlock the cursor

We want the repository to support multiple processes, multiple cursors, and protect against a few common programmer mistakes. Let's start by defining the cursor table structure:

CREATE TABLE "cursor" (     "process_name" TEXT NOT NULL PRIMARY KEY,      "position" BIGINT NOT NULL DEFAULT 0,     "created_at" TIMESTAMP NOT NULL DEFAULT CLOCK_TIMESTAMP(),     "updated_at" TIMESTAMP);CREATE TRIGGER cursor_update_trigger  BEFORE UPDATE ON "cursor"  FOR EACH ROW  EXECUTE FUNCTION set_updated_at();

Let's insert a row for our SMS cursor

INSERT INTO "cursor"("process_name") VALUES ('booking_sms_outbox');

Now let's write our cursor repository. We're going to instantiate a single instance of a cursor repository per process. Our solution will ensure that we never have multiple instances, and that we do not reuse an instance across multiple cursors. This is important because our repository needs to manage transactions and locking, but we want to hide these details from the caller.

type ProcessName = 'booking_sms_outbox' | 'foobar_populator';type CursorRepository = {   getPositionWithLock(): Promise<number | undefined>;   setPosition(position: number): Promise<void>;   unlockCursor(): Promise<void>; };const CURSOR_REPO_MAP: Record<string, CursorRepository> = {};export async function getCursorRepository(name: ProcessName): Promise<CursorRepository> {   if (CURSOR_REPO_MAP[name]) {     return CURSOR_REPO_MAP[name];   }   const client = await connectionPool.connect();   let isLocked = false;   CURSOR_REPO_MAP[name] = {     async getPositionWithLock() {       await client.query(`BEGIN;`);       const { rows } = await client.query(`         SELECT position         FROM "cursor"         WHERE "process_name" = $1         LIMIT 1         FOR UPDATE SKIP LOCKED         `, [name]       );       const cursor = rows.shift();       if (!cursor) {         await client.query(`ROLLBACK;`);         isLocked = false;         return;       }       isLocked = true;       return Number(cursor.position);       // casting bigint to number means this will work until 2^53-1 which is 9,007,199,254,740,991 -- if we inserted 10 million records per day it would take more than 2 million years to exceed this number.     },     async setPosition(position) {       await client.query(`         UPDATE "cursor"         SET "position" = $1         WHERE "process_name" = $2        `, [position, name]       );     },     async unlockCursor() {       if (isLocked) {         await client.query(`COMMIT;`);         isLocked = false;       }     },     releaseClient() {        client.release();     }   };   return CURSOR_REPO_MAP[name]; }

We've used memoisation to implement a functional singleton. This means the caller can safely call getCursorRepository('booking_sms_outbox'); multiple times but only instantiate one instance of the repository.

Our approach also prevents the caller from mixing up process names between calls. Additionally, we ensure that all calls for a cursor are made using an exclusive database connection. This is critical for our transaction and locking behaviour, just be aware that each of these processes will consume one connection per instance.

Now let's create the background process that uses this cursor repository.

Pro-tip: Set a sensible value for idle_in_transaction_session_timeout in the connection config used by the cursor. This will help prevent zombie connections from holding the lock indefinitely. You will also want to ensure you have process lifecylce hooks to unlock any locked cursors in the event of process exceptions, SIGTERM, and SIGINT.

Background Processor

We want to be able to start a background process that retrieves a cursor, gets new records, and handles each record sequentially. We're going to build a generic processor creation function, and then in the final step, we tie it all together with our handler.

type HasId = { id: number };type Propsextends HasId> = {  processName: ProcessName;  retrieveRecords: (position: number) => Promise;  processRecord: (record: Record) => Promise<void>;}const CURSOR_POLLING_SLEEP_MS = 200;export async function initialiseCursorProcess<DbRecord extends HasId>(props: Props) {  const cursorRepo = await getCursorRepo(props.processName);  try {    const runTick = await createProcessTicker(props);    while (lifecycle.isOpen()) {      const numProcessed = await runTick();       if (numProcessed === 0) {       await wait(CURSOR_POLLING_SLEEP_MS);      }    }    cursorRepo.releaseClient();  } catch (error) {    logger.critical('Terminating server: cursor process error', {      processName: props.processName,      error,    }); // we use structured logging    cursorRepo.releaseClient();    await lifecycle.close(1);  }}async function createProcessTicker<DbRecord>({  processName,  retrieveRecords,  processRecord,}: Props) {  const cursorRepo = await getCursorRepository(processName);  lifecycle.on('close', async () => {    await cursorRepo.unlockCursor();  });  return async (): Promise<number> => {    const position = await cursorRepo.getPositionWithLock();    if (typeof position === 'undefined') {      return 0;    }    const records = await retrieveRecords(position);    let processedRecords = 0;    try {      for (const record of records) {        await processRecord(record);        await cursorRepo.setPosition(record.id);        processedRecords++;      }    } catch (error) {      logger.error('Could not process record', {        processName,        record,        error,      });    } finally {      await cursorRepo.unlockCursor();    }    return processedRecords;  }}

A lot is going on here so let's break it down.

  const runTick = await createProcessTicker(props);  while (lifecycle.isOpen()) {    const numProcessed = await runTick();     if (numProcessed === 0) {       await wait(100);    }  }  cursorRepo.releaseClient();

First, we pass in our props and create the function that we want to run every tick. Then we start an infinite loop that will run until the node process receives a SIGTERM thanks to lifecycle.isOpen() using a lifecycle manager.

If nothing happened during each tick, we want to wait 100ms before running again. This prevents the process from flooding the nodejs event loop with callbacks in every tick. (wait is simply export const wait = async (ms: number) => new Promise(resolve => setTimeout(ms, resolve);).

After the while loop we call cursorRepo.releaseClient(); to close the connection to the database. This is required for await pool.end() to resolve during server shutdown.

What about that error handling?

  } catch (error) {    logger.critical('Terminating server: cursor process error', {      processName: props.processName,      error,    });     cursorRepo.releaseClient();    await lifecycle.close(1);  }

In this case, something pretty drastic has happened, we want to log our errors and shut down the server as quickly as possible.

Moving on to the meat of our implementation, the process ticker and its creator function.

async function createProcessTicker<DbRecord>({  processName,  retrieveRecords,  processRecord,}: Props) {  const cursorRepo = await getCursorRepository(processName);  lifecycle.on('close', async () => {    await cursorRepo.unlockCursor();  });  return async (): Promise<number> => {    // trimmed

In our createProcessTicker function we retrieve our cursor repo and include it in the closure space of our tick function that we define and return on the next line. I've also registered an onClose handler with our lifecycle manager to ensure any cursors release their locks before node exits.

Let's look at what happens in each tick.

    const position = await cursorRepo.getPositionWithLock();    if (typeof position === 'undefined') {      return 0;    }

If we cannot retrieve a position it means another nodejs process, perhaps on another server, has a lock on this cursor at present. In that case, we want to return 0 since we won't be processing any records and wait at least 100ms before trying again.

    const records = await retrieveRecords(position);    let processedRecords = 0;    try {      for (const record of records) {        await processRecord(record);        await cursorRepo.setPosition(record.id);        processedRecords++;      }    } catch (error) {      logger.error('Could not process record', {        processName,        record,        error,      });     } finally {      await cursorRepo.unlockCursor();    }    return processedRecords;

Now that we have some records, we simply initialise our counter then attempt to process each one and update our cursor position each time. We must make sure we always unlock the cursor when finished.

In the next tick, our process will re-attempt to process the record. This is handy if it failed due to reasons such as network error, but we will want to trigger alerts if we receive too many 'Could not process record' errors in a given period. It's also a good idea to measure how far behind the latest record each cursor is.

Time to bring this all together for our appointment booking SMS dispatcher.

Appointment Booking SMS Dispatcher

All of our process scaffolding is in place, now to build our SMS dispatcher.

export function initBookingSmsDispatcher() {  initialiseCursorProcess({    processName: 'booking_sms_outbox',    retrieveRecords: bookingRepo.getNewBookingsSinceId,    processRecord: async (booking: Booking) => {      await smsProvider.sendBookingConfirmation({ booking });    },  });}

The observant amongst you will notice the unawaited promise in initBookingSmsDispatcher. This is because we would be awaiting a while loop that only closes when the server is terminating. Accidentally awaiting a call to initialiseCursorProcess can cause the entire server to hang. So not only does initBookingSmsDispatcher nicely encapsulate all of our requirements for our SMS Dispatcher, but it also prevents us from accidentally deploying a goofy change failure bug.

For retrieveRecords we supply bookingRepo.getNewBookingsSinceId which is a repository function to retrieve all new rows since a given id. It might look something like this:

const INSERT_LATENCY_MS = 5;async getNewBookingsSinceId(id: number): Promise {  const isZero = id === 0;  const queryText = isZero ? `    SELECT * FROM "bookings" ORDER BY "created_at" ASC` : `    SELECT * FROM "bookings"    WHERE "created_at" > (      SELECT "created_at" FROM "bookings"      WHERE "id" = $1      LIMIT 1    ) AND "created_at" < (NOW() - INTERVAL '${INSERT_LATENCY_MS} milliseconds')    ORDER BY "created_at" ASC   `;  const { rows } = await client.query(    queryText,     isZero ? undefined : [ id ]  );  return rows.map(bookingRowToBooking);}

This prevents the skipping of records that were inserted out of order. There is a race condition in Postgres where sequence numbers are granted to transactions concurrently. This means row 1234 might be inserted after row 1235. Looking up by created_at means we still get row 1234 if we look for events created after 1235.

We also avoid looking up records inserted within the last 5 milliseconds using AND "created_at" < (NOW() - INTERVAL '${INSERT_LATENCY_MS} milliseconds'). The reason for this is that when inserting many rows rapidly, there is no way to guarantee that each row will be available for lookup in the same order as their created_at timestamp.

For example, if I rapidly insert rows A, B, C, D, E, F and perform this lookup at the same time, Postgres could return rows A, C, F. But a moment later, the same query could return rows A, B, C, D, E, F. Why? Because rows B, D, and E may have a larger payload that takes longer to write to disk. Without this minor 5 millisecond latency, the cursor would jump to position F, and rows B, D, and E would never be processed.

Why 5 milliseconds? It depends on the size of your payloads. At SKUTOPIA, our largest payload is <20 kB, and we expect 5ms to work reliably up to ~2,000kB. For us, this is a very acceptable trade off.

Pro-tip: Consider a BRIN index for your created_at column, depending on the size of your table. Also consider using CLOCK_TIMESTAMP instead of NOW for your created_at column, in case the application ever inserts multiple rows in one statement or transaction.

When to Avoid the Outbox Pattern

The Outbox Pattern is not a replacement for a job system. This outbox process will halt if it encounters an unprocessable message. That's a design feature for some high-integrity use cases, but a flaw in others. It also trades off throughput for guaranteed ordering. If you need to send a lot of messages concurrently as fast as possible with no consequence in terms of failure, then this pattern is a poor fit.

For this article's SMS example, I would personally use a job system like pg-boss, and push back on our fictional PM's request for sequential SMS messages. Customer communication rarely needs an outbox implementation, it's typically programmatic systems that have trouble receiving messages out of order.

Conclusion

Now that you have the foundations for any cursor-based process to progress through a series of database records, you can use it for any outboxes you might want to create. It also works fantastically with Event Sourced systems, which is where we use it at SKUTOPIA. Each of our domain services uses event sourcing for their application state and publishes events to a PubSub topic for communication amongst services. We also use it to ensure usage events are sent to our payment processor. A critical use-case where at least once delivery and message ordering are both required.

Where might you use an outbox? Let me know in the comments.

Tech’s Surprisingly Bad Math in Team Design

Anthony Manning-Franklin — Sun, 27 Nov 2022 12:33:28 GMT

How often have you seen this happen: The business is anxious that the product team is not kicking enough goals. They want better business outcomes faster, so they hire more software engineers.

Makes sense at first glance, right? Engineers write the code. Theyre the final step in value creation for the business. But is that really where the companys spend has the most leverage? We want to maximise our return on investment, and as were about to see, hiring more engineers is one of the most expensive ways to achieve that. Surprisingly, hiring more engineers can actually cause revenue to drop compared to staying the course with the current headcount!

A primer on value chains

To understand the following analysis, we need to understand value chains. A value chain is every step involved in creating a businesss product. In a manufacturing business, this might be

acquiring raw materials,
shipping them to a refinery,
refining them,
shipping the refined materials to a manufacturing plant, and then
manufacturing the final goods.

Each step in this process takes its inputs from the previous step, performs some operation on them, and produces outputs that will be the inputs for the following step. Often, upstream improvements in efficiency have a multiplicative effect on the outputs downstream.

So what is the value chain in a software company? Its roughly

Research: Identifying valuable problems to solve
Designing a feasible, viable, valuable, and usable solution
Experimenting to de-risk that solution: Usability testing, tech spikes, wizard-of-oz tests, fake door tests.
Creating the solution via code

Theres a much simpler way I like to think about this: Every solution you build is a gamble; engineers let you place more bets, and product and design help you choose the bets with the greatest odds of winning.

The Software Value Chain

Lets explore each of these steps. Every company does them to some extent, but the quality of execution at each step is not equally distributed amongst software product companies. In companies without a strong product culture, the early parts of the software value chain happen in a haphazard design-by-committee way from various stakeholders.

Research: Identifying valuable problems to solve

This is the responsibility of Product Managers. Many companies mistake Product Managers for Project Managers. They wind up paying lots of money for someone to spend their time writing tickets and micromanaging smart people. That is not the job of a Product Manager. Their primary responsibility is identifying problems worth solving and then de-risking those solutions.

That de-risking aspect is critical. Solutions are incredibly expensive to implement, and we have a limited number of bets we can place per year on features we hope will have a material impact on the business. But Product Managers cannot and should not do it alone. This brings us to the next step.

Designing a feasible, viable, valuable, and usable solution

Excellent product design doesnt happen in a vacuum, and it isnt a one-and-done process. If you think design is about aesthetics, you have completely misunderstood design. The goal of design is to marry the behaviour of the solution with the customers mental models of the problem, the task, and your application. Aesthetics are one tiny component of that incredibly complicated process. During the design phase, designers and product managers work together to address the four risks:

Value risk: Will the solution solve the problem in a way that provides adequate value for the customers and the business?
Viability risk: Can the business actually execute on this solution? Is the solution defensible and profitable? Do stakeholders in legal, marketing, customer support, and other functional departments have any objections to the solution?
Usability risk: Will a large enough segment of our customer base be able to understand and use this feature well enough to unlock value?
Feasibility risk: Are we sufficiently confident that engineering will be able to complete the feature in an acceptable timeframe? Will we be able to meet our non-functional requirements, such as performance?

To address these risks, we can use several discovery techniques, such as usability testing, empathy interviews, contextual enquiries, prototyping, tech spikes and proof of concepts, as well as working with stakeholders. The more experiments we can run, the more iterations design can create, the higher our confidence that the solution will provide value, AND the more value that solution will provide to customers and the business.

The critical thing to understand here is that every problem has multiple solutions, each of which has many variations itself. It is 100x cheaper to iterate through these solutions in the design phase than in the implementation phase.

Delivering the solution

The solution has been de-risked through several design and experiment cycles until we were confident it would achieve our desired business outcomes. Now, were ready to build it. Building and releasing software is hard. This is the most expensive and complicated part of the value chain.

Optimising The Software Value Chain

Now that we have thought about the entire value chain, lets think about optimising it. We cant do that without first considering how much we spend on each phase. For the sake of simplicity, Im going to say every person gets paid $150,000 per annum, and I will only count the base salary cost. If we have

1x Product Manager
1x Product Designer
5x Software Engineers

We are spending

$150,000 on Product Management
$150,000 on Product Design
$750,000 on Software Engineering

Which is $1,050,000 in total.

What does this spending get us? Spending on Product Management and Product Design increases our confidence in our solutions. Spending on Software Engineering increases the number of solutions we can ship each year. Or, to put it another way, spending on Engineering increases the number of bets we can place, and spending on Product increases the likelihood those bets will succeed.

However, there is a critical second-order effect here. As the software engineer-to-designer ratio tips further towards engineering, the workload on Product & Design increases, decreasing their ability to contribute to the solution's success. That is, spending more on engineering can actually cost you revenue due to a reduction in winning bets.

This is a lose-lose outcome for the business. They spend more to make less revenue. If you double the number of engineers, you dont get twice as many bets per year because diminishing returns eat productivity pretty hard. But you do create twice as much work for Product, and double the salary spent on engineering, which already accounted for over 70% of expenditure.

Why does this happen? Simply, the Product & Design team have less time per feature for experimenting and iterating. Fewer tests and fewer iterations mean a lower chance of success.

Unfortunately, most companies dont make time for Product & Design to do their discovery work at all. They simply build their best guess, which can work if you have plenty of cash to burn finding out what sticks, but in this climate, an ill-advised strategy.

So, what can we do to maximise our successful bets for the least expense? To maximise the companys profit?

Well, we cant increase the number of Product Managers per squad. That centralisation of context and stakeholder comms is critical to the role's success. Instead, we can double the number of designers. This is approximately only a 15% increase in expenditure, but it will almost double the capacity for discovery and research.

What impact does that have on profitability? Well, I built a spreadsheet to model this system! It shows that a 15% increase in spending results in over 50% more winning bets. It makes sense; youve effectively doubled the quality of the inputs going into engineering for a fraction of the price of the whole value chain.

Feel free to copy my spreadsheet and play with the numbers yourself. If youre a weirdo like me, its actually a lot of fun.

Conclusion

It seems insane to me that the tech industry has been working on the assumption of one designer per squad for almost 15 years without questioning it. Yet, nearly every designer in tech I speak to is burned out and scrambling to get designs ready for engineering, let alone having time to run proper usability testing and discovery. Instead, the solution most tech companies choose seems to be to ask engineers to do more, and hire more of them. Not only is it addressing the wrong part of the value chain, but it makes the problem worse!

This also comes back to the theory of constraints and bottlenecks. It looks as though engineering is the bottleneck because that is where the most activity occurs. But if you change how you measure the system, from the number of bets placed to the number of bets won, then the bottleneck shifts upstream in the value chain. The bottleneck is product and design.

If we address this problem as an industry, not only might we make much better products and regain some credibility after a glutenous frenzy on VC funding that has come crashing down. We might also make tech companies a great place for product designers to work, reduce the pressure on engineers, and save people from needless burnout and the health risks that come with it.

Belonging & Psychological Safety in Remote Teams

Anthony Manning-Franklin — Mon, 31 Oct 2022 12:13:43 GMT

Much of the existing literature regarding team culture assumes a co-located team. Its not that hybrid or remote teams didnt exist before the pandemic, but they certainly werent the norm. Existing studies consider facets such as desk position, building architecture, meeting rooms, seating arrangement, body language, physical contact, and team outings. How do we make the existing research relevant for a remote team?

When I joined SKUTOPIA in the middle of the pandemic as a fully remote Tech Lead, I had to figure out how to lead a team on the opposite side of Australia, over 3,000 km away. As the first remote team member, I began growing our Engineering department from 3 to 14. Along the way, I also added Product, UX, Data, and IT departments. Distributed across six cities and three countries, figuring out how to foster psychological safety and belonging on a remote team was not optional. It was do or die.

Culture: Safety and Belonging

Lets take a minute to define the most important parts of team culture. Just two things are the primary predictors of both team success and staff retention. It isnt average intelligence or training; it isnt free food, foosball machines or ping-pong tables. Its whether psychological safety and a sense of belonging are prevalent among the group. Thats it.

You will need more than those two things to succeed, but success is not probable without them.

What is psychological safety, and why is it so critical to performance? Psychological safety is whether you consider your social environment to be a hostile one or a safe one. In the workplace, this means we feel like we wont be punished for speaking up, voicing concerns, suggesting ideas, making mistakes, or asking questions.

Similarly, a sense of belonging is the feeling that we are wanted and accepted in our group, now and into the future. That we can contribute to the group, be recognised for that contribution, and thus maintain our place in the group.

In short, you cant have psychological safety without belonging, and you rarely find belonging without psychological safety. You must have both for your team to succeed.

Our brains are wired to live in groups; we die on our own. Our amygdala is constantly scanning our social interactions for signs of a threat to our status in the group. If we perceive a threat to our status, our prefrontal cortex is hijacked by the amygdala while our entire focus becomes reaffirming our status.

I can tell you from experience, a perceived threat to your continued place in a group sucks shiitake (thats the totally scientific term for it). Ironically, this undermines our ability to perform, and in a modern workplace, further weakens our status in the group. When groups cannot offer their members psychological safety, their culture becomes pathological, meaning:

People cease cooperating (someone elses success is a threat to your position)
Those who deliver bad news are punished for it
People attempt to do as little as needed to remain a small target: Its not my responsibility!
Communication and cooperation between groups ceases, and territorialism takes its place
Failures lead to scapegoating and punishment rather than inquiry and learning
New and novel ideas are rejected, and innovation stops completely (too risky)

The bad news is that psychological safety and belonging are both temporary. It requires a constant stream of affirming signals, not just an absence of threats. What serves as an affirming signal, then? In short, anything that fosters a sense of belonging. But what are the ingredients of belonging?

First, is that our role in the group is likely to continue, called future orientation. These signals indicate that our inclusion in the group is expected to continue into the future. This is why career development plans can help increase retention, but they are a rather clunky belonging signal. We will consider others later in this article.

Second is energy. These signals occur when people show enthusiasm for us. If youve ever had a boss who avoids one on ones or checks his phone during them, youve unfortunately experienced the absence of energy.

Third is individualisation, which occurs when we are treated as unique and individual. This is commonly referred to as feeling seen. It happens when people have bothered to notice traits that are our own. It could be something as small as your boss asking about your kids dance performance over the weekend and remembering their name.

What about the affirming signals for psychological safety?

First is vulnerability. A signal that were in a safe environment is that others are comfortable being vulnerable. This means we see people freely admitting their mistakes and being supported rather than humiliated or punished. For leaders, it is especially important to model vulnerability.

Second is embracing bad news. In a safe environment, bad news flows freely. People dont have quiet conversations about how to avoid bringing bad news to their boss; they go to their leaders for support.

Third is that everyone has a voice. In a safe environment, senior people arent expected to talk more than junior people. There arent rules governing who can ask questions or offer suggestions.

With all of these signals, frequency matters more than intensity.

Strong Signals in a Remote Workplace

Creating these signals in a remote environment is challenging but not impossible. It requires a little more intentionality; things that may have been intuitive in the office require conscious effort in a remote team. Although, for some of us neurodivergent folk, none of this was ever intuitive. Learning to consciously build the habit of using a particular communication style is a process we are very familiar with.

Lets run through some of these habits and considerations that foster a great remote culture.

Speedy handovers: Leave your mic on

The human brain is wired for conversation. It is one of our most extraordinary evolutionary feats, and we take it for granted every day. While turn-taking time between cultures ranges from 200ms to 1,000ms, the processing power our brain devotes to the art of gracefully coordinating conversations is immense.

When we switch speaking roles in under a second, this includes the time it takes our brain to prepare our vocal muscles, change our expression, and begin to speak. Before that, it has to begin planning what we are going to say, which means it has already parsed and processed not only what the other person is saying and what it means but also what the other person is likely to say.

This means our brain has already predicted when the conversational handover will occur long before the person speaking finishes their sentence. As this happens, we begin giving the speaker cues that we are about to take a turn speaking. This can be small sub-vocalisations such as ah or eh (in all cultures, these are open-mouthed vowels as these take the least effort to form), but we also use facial expressions and body language.

Now consider this in the context of a video call. The participant cannot see all of your body language. The signals they do receive are delayed 300 to 800 milliseconds. If your microphone is muted, they miss the sub-vocal signals. Plus, without conscious effort, you will probably unmute your microphone when you should already be talking, further delaying handover.

All up, were adding about 2,000 milliseconds to a process that usually takes 200 milliseconds. Yikes! This is one of the reasons it can feel so exhausting to be in video calls all day.

We can do a few things to improve turn-taking in a video call, but the most effective one is the simplest: leave your microphone on.

There are typically two reasons we mute our microphones, either because were doing something else, like checking Slack, and dont want people to hear us, or because were not using headphones and our speakers are causing some feedback.

Both of these issues are easy to solve. First, if youre going to be in a video call, either be present or get the fluck out. Im prone to distraction myself, so I make my video calls full-screen and mute notifications.

Secondly, if you will be making video calls from your workstation a lot, then invest in good equipment. Buy a decent pair of headphones or earphones and a microphone. Trust me, being understood is worth the effort.

And finally, if the acoustics in your home office are problematic, buy some acoustic treatment or add some large soft furniture. Acoustic treatment is about mass, airflow resistivity, and positioning. The goal is to absorb air particle movement at the point where particle movement is greatest and turn it into friction. This means you want to place acoustic treatment at a one-quarter wavelength from the wall at your lowest target frequency. For speaking, this is 1 kHz, so you want your acoustic treatment at least 8 cm (3 inches) off the wall if possible. I use acoustic insulation mounted in timber frames and covered in cloth, with spacers behind the frame.

Lastly, you want your active listening body language to work effectively in a video call. This means ensuring your camera is placed on the screen with the other participants. Theres nothing quite so disconcerting as talking to someone who looks away when youre speaking because their camera is beside them. I, personally, try to keep the video as close to the top and centre of the screen as possible so that it is as close to eye contact as possible during a video call.

And it almost goes without saying, but include as much of your upper torso in the frame, from an eye-level camera, with good lighting. Ive never felt terribly connected to disembodied eyebrows or looking up a hairy nostril.

Leaders, Highlight Your Fallibility

This is relevant in co-located and remote teams, but remote teams need a little more consideration regarding who has witnessed your fallibility. Showing vulnerability is the first step to forming trust; psychological safety cannot exist without trust. For leaders, this means highlighting your mistakes and making others aware of them when they otherwise wouldnt be. This can be as simple as a Slack message in a team channel announcing that you made a wrong decision on X, and thanks to feedback from person Y, you are now going to do Z. I cannot overstate the effectiveness of broadcasting your failures publicly.

Embrace The Messenger

Getting bad news flowing in a co-located team is hard, and bad news is critical to success. Its even harder in a remote team but simpler in other ways. The key is to set up multiple streams of bad news in different formats. A mixture of pull and push, weekly, fortnightly, bi-quarterly, one-on-one and group, anonymous and identifiable. Each of these different formats brings forth different types of bad news from different people. Ill run you through each practice I use.

One on Ones

fortnightly, pull, individual, identifiable

This is the most common approach to seeking out bad news. Make sure you specifically ask your direct reports about the challenges they face, their concerns, the things they have learned, their opinions of their teammates, and their suggestions for improving the team. The people you need to listen to most are the ones who are most reluctant to bring up issues, especially regarding other team members. You need to give them a non-judgmental way of speaking about these topics with open-ended questions.

Retrospectives

fortnightly, pull, group, identifiable

This is almost the group equivalent of a one-on-one. While we dont use Scrum anymore, we kept the fortnightly retrospectives because it is great for the team to discuss their issues and concerns. I try to stay quiet in these meetings. Our job as a leader in a retro isnt to solve problems. It is to understand them. The team will come up with actions to solve the short-term problems. You need to pay attention to the systemic causes. Solving these issues goes beyond the retro.

My rule of thumb during a retro for anyone in a leadership position is that they must ask at least two questions before offering a solution for a given topic. And be very cautious offering any solution that isnt a concrete action anyone on the team could accomplish before the next retro. Ive seen too many aspiring leaders completely undermine themselves in retros by offering a so-called solution that really was a suggestion that the complainant themselves are the problem, and they should adjust their expectations. Not good, at all.

(If someone really does need to adjust their attitude, the retro is not the place for that conversation.)

Keep an eye on who is contributing the most to a retro, and who isnt. Its worth explicitly asking for input from quieter participants from time to time. One cause of an imbalanced retro is that the most frequent speaker is actually speaking on behalf of others. A large imbalance implies there is a trust issue in your team. One person is exceptionally safe, and quite likely, one person in particular is making other people feel threatened. People are afraid to speak up in front of the threat, so they channel their concerns through the safe person. This is a big red flag for an imminent issue that could otherwise fly under the radar on a remote team. Start investigating!

Team Health-Checks

bi-quarterly, pull, group, identifiable

Every six weeks, each of our squads runs a health check session. The metrics we use for the health checks are factors that the team decided were critical to their success. It is their opportunity to hold leadership (me) accountable for giving them what they need. I reiterate that at the start of every session, and then shut my mouth, letting someone else facilitate the session.

We also make a point that the score we record for each factor is the lowest score, not the mean or mode. Were a team, and that means that any one person suffering is a loss to all of us. Were all responsible for helping each other; we cant let our teammates set themselves alight to keep the others warm.

Anonymous Surveys

ad-hoc, pull, group and one-on-one, anonymous

I will regularly put together anonymous surveys on hot topics for the team. This might include communication preferences, tooling state, psychological safety, etc. Anonymous surveys let the team be more candid than they otherwise might be. Be careful; if your anonymous surveys are a lot more negative than your retros or one on ones would suggest, it is likely that your team dont feel safe speaking their minds.

If your team uses Google Workspaces, these are easy to run using Google Forms with a single response per person while maintaining anonymity.

I refer to surveys as a group and one-on-one channel because the quantitative data is group, while the qualitative data can be individualised. Include a mix of both in your surveys, through Likert-style questions and open-ended questions.

The Insider

bi-quarterly, pull, one on one, anonymous

I will regularly have a peer on the team conduct one-on-ones with the team members themselves. By speaking among equals rather than with their direct manager, they may feel safer raising other issues. This can be especially true regarding interpersonal conflicts. The time to resolve interpersonal conflicts is as soon as possible, but people generally want to avoid conflict and are particularly averse to going to their manager with petty issues.

I will then have my insider report back to me with the general zeitgeist of the team. This is about discovering the issues, not who said what. Confidentiality is essential for the teams confidence and the ongoing success of the insider.

Guild Slack Channels

continuous, push, group, identifiable

We have guild channels for engineering, product, and design. In many instances, people will raise concerns they have about process, tooling, or workflow. We also have team channels. The hard part is getting the ball rolling with suggestions. If the team doesnt have people who feel comfortable doing this already, then you need to ask questions and solicit feedback. After doing this enough times, people will begin raising their questions, concerns, and ideas.

The Missing Element

Astute readers will notice a missing combination: continuous, push, individual, and anonymous. This would be the digital equivalent of an anonymous suggestions box. I havent personally implemented this combination myself, but it could be feasible with a repeatable anonymous form and a regular Slackbot reminder to solicit contributions. If you try it, let me know how it goes!

Now, thats enough on the topic of soliciting bad news. Lets get back to our contributing factors for a healthy remote culture.

Over-the-top Gratitude

Showing gratitude and appreciation on a remote team can be as varied and fun as it is with a co-located team. But as with anything remote, it requires a little more conscious effort.

Everyone has different ways they like to receive praise. I ask new direct reports if they are comfortable receiving praise in public. Occasionally, someone will prefer not to be called out in front of the company for their awesome achievements!

There are a few things to consider when giving praise remotely, they are:

Medium
Tone
Audience

The mediums being chat, email, video call, voice call, pre-recorded or live. Tone is a question of formal or informal, and audience is, well, the audience.

For example, celebrating someone's success in the company-wide announcements channel will need a different approach than in a video call with their squad, and so on. The guidelines roughly work out like this:

The faster the medium, the less formal the tone Slack is less formal than email
The larger the audience, the more formal the tone A Slack message in the company-wide channel might be equally formal as an email to the squad.

Theres another underlying factor, but company-wide praise of your direct reports can seem like a political game of optics rather than genuine praise. Context and tone are essential here. And remember earlier when I said individualisation was a signal for belonging? Its harder to individualise your message when sending it to a wider audience.

For this reason, my favourite way to celebrate peoples success is in a Slack channel with their primary team. This allows me to go all out, using some ALL-CAPS, emoji bombs 🎉🤩🕺, an excessive number of exclamation marks (!!!), and some insider references or jokes.

This might sound over the top, too informal, too vulnerable. That is why it works. Text communication needs the extra oomph, but also, when youre celebrating success, you need to be a person, not a boss. Done right, its a key moment for bonding the whole team because it creates a moment of shared pride.

I cannot stress enough how important it is for the people observing you giving praise to someone else, to feel joy themselves; To empathetically feel and share that sense of pride.

Imagine going to someones graduation ceremony. Is the healthy response beaming with pride, or seething with jealousy? Healthy teams experience the former when celebrating each others success. So many of these rituals we have as humans arent about celebrating the individuals success as much as they are for the group to bond over a sense of shared pride.

In a healthy remote culture, this means your celebratory message should receive many emoji reactions and enthusiastic replies. A dry, formal congratulatory email is not likely to elicit a positive emotional response from its audience. But an over-the-top, emotion-laden, emoji-bombed message has a certain infectious joy!

As important as that asynchronous Slack message is, you need to back it up with another congratulatory moment in a video call with the team. This is critical; we need both the written form and the body language of a video call. Neither on their own is quite enough in a remote team. I also back it up with more congratulatory praise in our next one-on-one. Theres no such thing as too much gratitude.

If you can, sending gifts can also help your team seem real and connected. This could be tasty treats, plushies, vouchers, books (not work-related!), or whatever you think will be appreciated. If youre lucky, the team will adopt this practice themselves, sending each other gifts to show appreciation. This hits a belonging double-whammy of energy AND individualisation.

As important as big moments of recognition are, it's important to have plenty of little moments of recognition too. I recommend setting up a #high-fives Slack channel where anyone can publicly show gratitude for people who look out for their teammates, whether small or big.

Manufacturing Collisions

Co-located workplaces lead to many accidental collisions. A collision is a random encounter that winds up being beneficial in terms of either information sharing, belonging, or psychological safety. It can be a conversation in the elevator, grabbing a coffee, or overhearing a conversation near your desk.

In a remote team, accidental collisions happen almost exclusively in chat. The rest we need to consciously plan for. Ive tried each of the following ideas to varying degrees of success with different teams.

Coffee-run Slack Huddles

The morning coffee run can be a great bonding experience in the office, but what is its equivalent on a remote team? One method Ive tried is a regular slack huddle. It has had varying success, but I think my approach with this technique fell into a negative network effect. I recommend trying with small groups in the same time zone. Even better if you can actually consume a hot beverage at the same time.

Donut: Random one-on-ones

Donut is a Slack app that pairs everyone up together randomly every week or so for a virtual coffee. Some people will love having time to get to know their colleagues and talk about things other than work. Others will loathe it.

Gather: Avatar Proximity Video Chat

Apps like Gather aim to mimic the spatial element of real-world conversations. Everyone has an avatar in a 2D world, but you can only hear and see the people standing near you. This allows people to drop in and out of conversations, break into groups, and swing by your desk.

Ive used this for two purposes: mimicking the office and socialising. It works to varying degrees for either purpose, but again, it resonates with some people but not others.

Standups: A Double-Edged Sword

We dropped stand-ups in favour of asynchronous status updates. Then we returned to doing them three times a week in groups of less than six people. The larger the group, the less each person talks relative to the length of the meeting. On a video call, this is even worse.

We returned to stand-ups, not for productivity, but because we felt socially unmoored. It was possible to go an entire week without seeing the face of one of your squad mates.

Generally, I would say never use meetings for status updates, but this is one exception. Just keep the number of attendees low and the meeting very short! Anything bigger will not scale with the team.

Pick Up The (digital) Trash

Remote teams still have necessary but mundane tasks. As a leader, do them! After we split our Jira board into two projects, we wound up with both teams tickets in the new board, and all past and present tickets were reset to to-do and turned into tasks! I reviewed nearly 500 tickets individually and deleted roughly 400 of them. Just like cleaning up around the office, it shows that no one is above caring for the team and doing what needs to be done.

Leverage Threshold Moments

Onboarding on a remote team can be even more daunting than in person in some ways. That means you need to perfect every single element of the experience. If you have flashy promotional videos, include them in your onboarding docs. Set up plenty of one-on-ones with different team members. Dont be afraid to use over-the-top language and raise the emotional intensity a little, be cheesy and fun, with some self-awareness of your awkwardness.

This is also an excellent moment for connecting their role to a sense of purpose. It can be challenging, but you must connect their work to the companys mission. In a physical office, you can cover your walls with purpose and symbolism. In a remote team, you need videos, messages, and other constant reminders, rather than relying on ambient indicators of purpose.

Bring together groups of teammates for get-to-know-you video calls instead of team lunches. Prepare some icebreakers; the more awkward they are, the better. I find five people is the most you can have on a call before turn taking breaks down and people begin to disengage. You have to work harder to turn the abstract concept of the remote team into a concrete experience, but it is definitely achievable.

Bringing It All Together

No two teams are the same. In fact, someone once said to me, teams are immutable, meaning that every time someone leaves or joins a team, its really a whole new team. While analogies are never entirely true, this one holds some truth.

What has or hasnt worked for me may work for you and your team. It might even work for my team at some point in the future when it hasnt in the past. These ideas are worth trying at least once, if not twice.

That said, the fundamentals never change. Belonging will always require a steady stream of future orientation, energy, and individualisation. Psychological safety will always require vulnerability, open discussion of bad news, and everyones voices being heard.

I hope this article has given you some food for thought, and as always, I would love to hear what has worked for you in the comments. What are your techniques for fostering psychological safety and belonging in remote teams?

Measuring Apdex from access logs in SumoLogic

Anthony Manning-Franklin — Thu, 18 Aug 2022 09:30:09 GMT

Application Performance Index (Apdex) is a standardised method for calculating the perceived satisfaction of a user accessing your service. It divides all served requests into three categories: satisfied, tolerating, and frustrated.

A user's request is said to be satisfied when it occurs within some T value, such as 400ms, and is successful, e.g. 2xx or 3xx status codes.

A tolerating request is successful in more than T, and less than 4T.

Frustrated requests exceed 4T or fail, e.g. 4xx and 5xx status codes.

So how can we build this measure in SumoLogic? Let's take a look

| json auto field=raw_log| if(statusCode matches "2*", if(responseTime <= {{Apdex_Time}}, 1, 0), 0) as satisfied_counter| if(statusCode matches "2*", if(responseTime < {{Apdex_Time}} * 4 && responseTime > {{Apdex_Time}}, 1, 0), 0) as tolerating_counter| timeslice 150 buckets| count as total_logs, sum(satisfied_counter) as satisfied, sum(tolerating_counter) as tolerating by _timeslice| ((satisfied+tolerating/2)/total_logs)as apdex| fields apdex, _timeslice

We use structured logging, so our logs are JSON formatted, but you could do this just as easily via a regex capture on apache style access logs to extract the status code and response time.

This simply creates a counter for satisfied and tolerating using nested if functions with the matches operator. The frustrated queries are everything not captured by these two counters, so count as total_logs gives us everything else we need, assuming our log source only contains access logs.

And that's it! You can even overlay the percentage tolerating, frustrated, and satisfied if you like:

| ((satisfied+tolerating/2)/total_logs)as apdex| satisfied/total_logs as satisfied_pct| tolerating/total_logs as tolerating_pct| (total_logs - satisfied - tolerating)/total_logs as frustrated_pct| fields apdex, _timeslice, satisfied_pct, tolerating_pct, frustrated_pct

Applying Google's Testing Methodology to Functional Domain-Driven Design For Scalable Testing

Anthony Manning-Franklin — Sun, 08 May 2022 10:30:08 GMT

Recently I wrote an article about applying Functional Programming to Domain-Driven Design. One of the key benefits of that approach is improved testability, but we didn't get to delve into it too deeply.

In this article, we will consider what factors make an automated test suite great. We will bring together a lot of ideas from Software Engineering at Google. Whenever I refer to "Google" in this article, I am referring to the authors' depiction of Google's engineering practices in the book.

I will also translate some of Google's testing methodology to TypeScript. We will then see how it fits with Functional Domain-Driven Design (fDDD).

Good Test, Bad Test

Let's start by defining what a good or bad test suite is.

There are five dimensions we can use to consider the quality of our tests. These are brittleness, flakiness, speed, and readability. In the same sense that optimising outside the bottleneck is wasteful, the metric a team needs to focus on improving is whichever one is worst at any given time.

Brittleness (Durability)

A test is considered brittle when it breaks due to unrelated changes. If you have ever changed a small piece of code as part of a (supposedly) simple ticket and wound up failing scores of seemingly unrelated test cases, you have experienced the frustration of brittle tests.

Good tests should be updated less frequently, while bad tests sap productivity with needless changes. Measuring how often tests are changed can be an effective way of finding brittle tests.

Flakiness (Reliability)

A flaky test is a non-deterministic test. That is to say; sometimes it fails despite neither the code nor the test changing. When tests are flaky, our continuous integration and deployment pipelines suffer from unnecessary re-runs, our lead time for changes grows unnecessarily, and worst of all, engineers pay less attention to the tests.

We can measure flakiness in the number of times we can run a test suite and have it return the same result. That is to say that a test suite that, if I ran it twice, failed once and succeeded once, its flakiness would be 100%. The equation for flakiness percentage is failures / successes

Speed

Tests work best when they can provide real-time feedback as part of an engineer's workflow; The best tests can run in the IDE while the engineer is coding. However, as we will see later, some tests must sacrifice speed to interrogate the subject under test.

Fast tests improve the local development experience, increase the utilisation of tests amongst engineering teams, and improve the lead time to deployment.

Readability

Ultimately our tests need to document expected behaviour for other engineers to read. Since the best tests are updated the least, they will likely be read many more times than they are written. The best tests are clear, concise, and simple. Each test should only test one behaviour. The prerequisites, action, and expected result should be clearly expressed such that even someone unfamiliar with the code can understand the test.

A good litmus test for test cases can be checking if a non-technical stakeholder such as the Product Manager understands them.

Accuracy

It's great having durable, reliable, fast, and readable tests, but it is all for naught if the tests continue passing when the system's behaviour changes in a breaking way. However, accuracy doesn't become a focus for many teams because they struggle with the other dimensions of testing. Most teams only write example-based tests, which have the least accuracy, but other methods such as property-based testing and mutation testing can help ensure that our tests are rigorous and improve their accuracy.

Big Test, Little Test

While most people consider tests in terms of unit, integration, and end-to-end tests, Google has a more precise hierarchy of tests: Small, medium, and large. But what do these classifications mean? They might sound subjective, but they each have a specific objective definition.

Small Tests

Google's rules for small tests are:

The test must run on one thread on one machine
The test must not use sleep
The test cannot perform I/O
The test cannot make blocking/async calls

In TypeScript, these translate to:

The test must run in one Node process
The test must run synchronously: it cannot use async/await/promises
It must run in a single tick of the event loop (this means no using setTimeout or setInterval for side-effects)
The test cannot perform I/O (it cannot use synchronous file-system access, for example)

I would also add two more constraints:

The test must not utilise system time
The test must not utilise pseudo-random number generators or randomness of any kind

In practical terms, this rules out testing against a database, mocking asynchronous systems (a cause of much brittleness AND inaccuracy), and even starting an Express server in the test suite.

Medium Tests

Google's rules for medium tests are:

The test may run on multiple threads or processes, but only on one machine
The test may make blocking calls
The test may call localhost, but not the network

In TypeScript, these translate to:

The test may run an additional node process, may utilise a local database, etc
The test may use async/await (Promises)
The test may use multiple ticks of the event loop
The test may perform I/O
The test may call localhost, but not the network

In practical terms, you can now test against a local database, use a tool such as Supertest to test your Express server, and more.

Large Tests

For large tests, all the constraints are removed. An example of a large test would be using Cypress to run end-to-end tests against a staging deployment. As we know, networks are both laggy and unreliable, so these tests have the highest flakiness and slowness.

Comparing Test Sizes

With these constraints in mind, we can see that small tests are the best performing in all categories:

Durability: Small tests necessitate a limited scope, so it will be less likely that they will inadvertently rely on related components
Reliability: The constraints for small tests remove all opportunities for non-determinism to enter the test suite
Speed: Small tests are CPU bound, consume the least resources and are therefore faster than their larger counterparts
Readability: Small tests are generally simpler and thus easier to understand
Accuracy: While small tests are not intrinsically more accurate on their own, their other attributes make it easier for engineers to write and run more of them, more examples, and even use approaches such as property-based testing.

For this reason, we strive to define our test pyramid, not in terms of unit or integration tests, but instead based on this size taxonomy:

As we move further up the pyramid, we lose out on all five factors of a good test. For this reason, we want to structure our code such that this distribution of test sizes is feasible. This code structure is where fDDD can help us!

Testing in Functional DDD

Writing Small tests can be highly challenging unless we take care to structure our code in specific ways. Luckily Functional DDD gives us precisely this structure. From here, I will continue assuming you are familiar with fDDD.

Small Tests: Derivers & Invariants

Since both Derivers and Invariants are Pure Functions, their tests automatically meet the criteria for smallness. This functional purity allows us to test our most valuable code quickly, easily, and to a high standard.

Medium Tests: Controllers

Controllers break the synchrony constraint and so cannot be deemed small tests. However, we can still benefit significantly by using the Partially Applied Controller pattern to reduce dependencies such as a local database, remove network requests, etc. We should strive to make even medium-sized tests as small as possible!

Another way of improving the balance between small and medium-sized tests is to remove tests for Controllers where they provide little value. Testing a Controller that simply flushes the derived result to the database on success is not a very valuable test.

If the test provides almost no value, then it may have a net-negative impact due to the increased brittleness and flakiness it could introduce. Having a solid foundation of small tests makes it easier to remove low-value medium-sized tests.

Large Tests

Large tests are not a concern of fDDD, but we will mention them for completeness. Large tests are most useful for testing configuration, emergent behaviour, and evolutionary architecture against fitness functions.

Types of large tests include:

Automated UI tests on deployed applications
Automated API tests on deployed applications
Load testing on deployed applications
Performance testing on deployed applications

The critical distinction here is that we are testing the entire system in the context of a deployed application. We could implement UI, API, visual regression, and performance testing as medium-sized tests, which would make them easy to run on pull-request, but they wouldn't be testing the system.

However, large tests are expensive, and most teams need to prioritise which non-functional requirements are critical to their applications.

Improving Readability

This article has spoken about flakiness, brittleness, accuracy, and speed, but we haven't talked much about readability yet. Let's discuss a few simple ways we can improve the readability of our tests.

Hermeticity

A test should contain everything it needs for setup and teardown while making no assumptions about the external environment, such as the state of the database.

For example, we once had a flaky test suite that was hard to track down. It was slightly more reliable in CI, but it was extremely flaky on local. It turned out that there were two pieces of code referencing a feature toggle in the database. One of these tests would change the state of the toggle, so the order that the tests were executed in could change the outcome of the test suite! But worse than that, people's local databases often weren't in a state compatible with the test.

In this case, the test's readability was poor because we defined part of the test's behaviour in an entirely separate system. A system that the test code didn't reference.

No logic

Tests should be so simple that they must not use control-flow statements such as if, for, and while. If your test is so complex it could have its own test, it isn't going to be effective, and it certainly isn't going to be clear to another developer when their change breaks your test.

Behaviour-based tests

Each individual test case must test one and only one behaviour. This rule is made clearest with an example:

it('updateUser', () => {  const user: User = { email: 'foo@bar.com' };  const result1 = updateUser(user, { name: 'bronson' });  expect(result1).to.deep.eq({ email: 'foo@bar.com', name: 'bronson' });  const result2 = updateUser(user, { email: 'bar@foo' });  expect(result2).to.deep.eq({ error: 'invalid email address' });});

The test above is trying to test an entire function, rather than a single behaviour of that function. We can improve the clarity of this test by splitting it into two behaviour based tests:

describe('updateUser', () => {  it('should add a name property to an existing user', () => {    const user: User = { email: 'foo@bar.com' };    const result = updateUser(user, { name: 'bronson' });    expect(result).to.deep.eq({ email: 'foo@bar.com', name: 'bronson' });  });  it('should return an error when supplied an invalid email address', () => {    const user: User = { email: 'foo@bar.com' };    const result = updateUser(user, { email: 'bar@foo' });    expect(result).to.deep.eq({ error: 'invalid email address' });  });});

Now if I accidentally break the email validation logic, the test failure cause will be apparent even before I read the test code.

DAMP, not DRY

Google defines DAMP as promoting "Descriptive And Meaningful Phrases" -- a reverse-engineered acronym if I ever saw one. However, the principle is sound; tests benefit more from clarity than code reuse.

In the following example, we have created helper functions to setup test state:

it('should allow users to send friend request', () => {  const users = createTestUsers(2);  const { outcome } = sendFriendRequest({ from: users[0], to: users[1] });  expect(outcome).to.eq('SUCCESS');});it('should not allow a banned user to send friend request', () => {  const users = createTestUsers(2, true);  const { outcome } = sendFriendRequest({ from: users[0], to: users[1] });  expect(outcome).to.eq('BANNED_USER_CANNOT_SEND_REQUEST');  // This test suite is failing});

And then elsewhere in the file:

const createTestUsers = (numUsers: number, ...bannedUsers: boolean[]): User[] => {  const users = Array.from({ length: numUsers }, (_, idx) => ({    email: `user${idx+1}@test.co`,    banned: bannedUsers[idx+1] ?? false,  });  return users;};

The issue is that while reading the test cases, it is unclear what is happening, and is actually dependent on iteration logic inside the helper. Did you spot the bug? Consider instead:

it('should allow users to send friend request', () => {  const user1: User = { email: 'user1@test.co' };  const user2: User = { email: 'user2@test.co' };  const { outcome } = sendFriendRequest({ from: user1, to: user2 });  expect(outcome).to.eq('SUCCESS');});it('should not allow a banned user to send friend request', () => {  const user1: User = { email: 'user1@test.co', banned: true };  const user2: User = { email: 'user2@test.co' };  const { outcome } = sendFriendRequest({ from: user1, to: user2 });  expect(outcome).to.eq('BANNED_USER_CANNOT_SEND_REQUEST');});

Here the difference between test cases has been made explicit within the test cases themselves.

Conclusion

Now that we've defined the five factors of a good test suite, Google's test size taxonomy, the benefits of fDDD in test quality, and a few tips for improving test readability, hopefully, you have a few ideas for improving your team's automated testing. Of course, there is a lot more that one could say about writing great tests, but I will leave that for another article.

Summary:

You can measure the quality of a test suite in terms of brittleness, flakiness, speed, accuracy, and readability
Google break their tests down by a taxonomy of sizes:
- Small: Single machine, single-threaded, synchronous, with no I/O or sleep
- Medium: Single machine, multi-threaded, asynchronous, localhost only
- Large: No constraints
fDDD provides a pattern for implementing a high number of small tests protecting the most valuable business rules
- Use small tests to test derivers and invariants
- Use medium tests to test controllers
- Use partially applied controllers to keep medium-sized tests as small as possible
- Use large tests to test non-functional system factors such as scalability
Readability can be improved by:
- Hermeticity: Remove environmental dependencies from tests
- No logic or control flow in tests
- Writing individual test cases to test an individual behaviour rather than functions
- Writing explicit DAMP test cases where relevant information is repeated rather than abstracted

Functional Domain Driven Design: Simplified

Anthony Manning-Franklin — Sun, 01 May 2022 13:45:28 GMT

Domain-Driven Design (DDD) simplifies the development and maintenance of complex software applications. However, two seminal books on the topic, Domain-Driven Design and Domain-Driven Design Distilled focus on an object-oriented implementation.

How can we translate the tactical patterns of DDD into a functional programming paradigm? This article will show you how I do it in TypeScript and the many simplifications and benefits. Lets begin with a refresher on DDD.

Domain Driven Design

Domain-Driven Design focuses on the business rules, business language, and business problems as the primary focus of software design. Instead of primarily designing software in architectural layers (e.g. database, data transfer objects, models, controllers, views), it focuses on building software with discrete domains or bounded contexts (e.g. billing, authentication, insurance sales, insurance claims, etc.)

DDD aligns our software architecture along the axis of most significant change; the business generates requests for changes to implement new business rules implemented in the domain. Focusing solely on software layers may produce greater code reuse, but reuse inhibits the ability to respond to changing business requirements. If both the Sales and Claims domain use an Insurance Policy entity, then a change on behalf of the sales team could inadvertently create a bug for the claims team. In DDD, we would make a separate SalesPolicy entity and a PolicyClaim entity, thus decoupling these two business domains in the codebase.

Domain-Driven Design splits its patterns into two categories: strategic patterns and tactical patterns. The strategic patterns are the easiest to reuse in any language, framework, and programming paradigm. They are universal aspects of software design.

However, the tactical patterns in DDD are tightly coupled to an object-oriented programming style and influenced by the capabilities of languages prevalent during DDDs conception. These tactical patterns are where we will apply a functional approach to DDD and modify, remove, or create new tactical patterns.

Before we move into Functional implementations of tactical DDD patterns, lets review the strategic patterns in DDD. Feel free to skip ahead if you are already familiar with these concepts.

Strategic DDD Patterns

These patterns govern the overall approach to your code and even your system architecture. They are relevant regardless of the language, framework, or paradigm. Since there is already a wealth of information about these patterns, we will only briefly cover some of the most crucial strategic patterns.

Bounded Context

A Bounded Context is a conceptual boundary in designing a system wherein the meaning of business terms is ubiquitous and consistent. For example, we may have the concept of a Policy entity in insurance. However, different teams in the business will have different interpretations of a policy, i.e. the sales team, the claims team, and the actuarial team will all assign the Policy entity different meanings.

By creating a bounded context in our design, we can effectively isolate each domain from the other. I use the term bounded context (context) and domain interchangeably throughout this article.

Context Map

Context mapping is how we identify, understand, and communicate the Contexts/Domains in our system. You will often identify common entities in context mapping an important tenet of DDD is that we do not attempt to remove or share common entities between contexts. Instead, we allow each context to maintain its implementation and version of the data within its domain.

Anti-corruption Layer

In the above diagram, the "claims context" would implement an anti-corruption layer to adapt the incoming policy data into the format and structure meaningful to the "claims context". This pattern prevents details of the upstream entity from leaking into the downstream context and adds some protection against upstream changes.

Functional shortcomings in OOP DDD

Much of Classical DDD refers to methods of correctly allocating behaviour to the correct entity, aggregate root, domain service, or value object. Many of its rules aim to give DDD practitioners confidence in correctly distributing behaviour amongst various classes in their implementation.

However, Functional Domain Driven Design (fDDD) does not share these problems as data and behaviour are uni-directionally coupled rather than bi-directionally. That is to say that entities in Functional Programming cannot have their own behaviour but instead couples functions to data in the form of parameter types.

Any number of functions can use entities, but functions can only use entities matching their type signature.

Let's take stock of the Classical DDD concepts we have abandoned in fDDD:

Aggregate Root: In fDDD, transactions are bound to the Controller
Value Object: In fDDD, values are just literal values
Service: In fDDD, this is most closely related to the Controller
Domain Service
Factories

Functional DDD Tactical Patterns

Its time to consider the practical implementation of DDDs Tactical Patterns within a Functional Programming paradigm. We have two layers within these patterns: a Functional Core and an Imperative Shell.

The Functional Core implements tactical patterns using only Pure Functions. This layer is where we implement the bulk of our business rules since Pure Functions provide us with the greatest degree of predictability, reliability, testability, and changeability.

In the Imperative Shell, we want to use tactical patterns that help us coordinate between our systems and our business rules. For example, code in the imperative shell may be responsible for retrieving necessary data from the database, checking our invariants, inserting the changes back into the database, and dispatching an email.

Lets start from the top with our Functional implementation of entities, which lives across both Imperative Shell and Functional Core.

Entities

In fDDD, you will commonly implement Domain Entities as type definitions. For example, I might define a Policy in the Sales domain as follows:

type Policy = {  id: number;  customer: Customer;  salesPerson: Staff;  createdAt: Date;}

While this approach to defining the entity is straightforward, it alludes to our implementation of various adaptors either in an anti-corruption layer or in our repositories, as we will cover soon.

A useful example might be our approach to parsing this entity when we receive it across the network. In that case, we may implement a parser function such as:

type PolicyParser = (rawData: unknown) => Policy | ParseError;

This parsing function would create a type-safe run-time implementation for bringing the Policy entity into our domain layer. Rather than implementing these parsers by hand, I often use a run-time typing package like zod.

Invariants: Functional Core

Invariants are Pure Functions with only one job: check that the provided entity meets a given business rule. In the case of an address entity, we might write an invariant that confirms that the provided phone number has an area code matching the address region.

export const validatePhoneMatchesCountry = (address: Address): boolean => {  if (address.phoneNumber ?? false) {    return false;  }  return getCountryFromAreacode(address.phoneNumber) === address.country;}

As you can see, invariant functions can be tiny and very easy to test.

We primarily use Invariants by composing them inside Derivers. We may also, cautiously, create an invariant comprised of other Invariants if many Derivers share the exact composition of Invariants.

Derivers: Functional Core

Derivers are Pure Functions created to support a specific operation, such as createAddress or cancelSubscription. When we call a deriver, we must pass it all the information it requires to either derive the delta for our change or return an outcome indicating the cause for failure.

type UpgradeSubscriptionOutcome = UpgradeSucceeded   | AccountOverdrawn   | InvalidSubscriptionStatus;export const deriveUpgradeSubscriptionOutcome = (  newPlanLevel: Subscription['planLevel'],  subscription: Subscription,   customer: Customer,  upgradePriceMap: PriceMap): UpgradeSubscriptionOutcome => {  if (!validateCustomerBalance(customer)) {    return {      outcome: 'ACCOUNT_OVERDRAWN',      payload: { balanceOwing: customer.balance },    };  if (subscription.status !== 'ACTIVE') {    return {      outcome: 'INVALID_SUBSCRIPTION_STATUS',      payload: {         currentStatus: subscription.status,         expectedStatus: 'ACTIVE'       },    };  }  const prorataDays = calculateProrata(subscription);  const upgradeFee = upgradePriceMap[subscription.planLevel][newPlanLevel];  return {    outcome: 'SUCCEEDED',    payload: {      prorataDays,      upgradeFee,    },  };}

In the Deriver above, we check two potential failure cases and then calculate the change required by the requested upgrade. Consider what we are not doing in this function:

Not getting the customer's details from the database
Not sending the customer a receipt via email
Not calling a payment provider to charge their credit card
Not saving the changes to the database

The core business rules for this operation have been condensed into a single function, enabling us to test every business rule. By keeping the scope of this function narrow, we can maintain its functional purity. This limited scope makes writing unit tests for this function less complex and makes the test itself more reliable. There are no opportunities for non-determinism, no database fixtures, and no stubs or mocks.

Controllers: Imperative Shell

We use Controllers to coordinate asynchronous parts of the system during an operation and call the Deriver. In the above example, the Controller would be responsible for each step we identified as out of scope for the Deriver.

export const upgradeSubscription = async (  customerId: Customer['id'],   newPlanLevel: Subscription['planLevel']): Promise => {  const [customer, subscription] = await Promise.all([    customerRepo.getById(customerId),    subscriptionRepo.getByCustomerId(customerId),  ]);  const { outcome, payload } = deriveUpgradeSubscriptionOutcome(    newPlanLevel,    subscription,    customer,    UPGRADE_PRICE_MAP,  );  switch (outcome) {    case 'INVALID_SUBSCRIPTION_STATUS': {      return { outcome, payload };    }    case 'ACCOUNT_OVERDRAWN': {      await sendRepaymentReminder(customer, payload.balanceOwing);      return { outcome, payload };    }    case 'SUCCEEDED': {      const updatedSubscription = await subscriptionRepo.update({         ...subscription,        prorataDays: payload.prorataDays,        planLevel: newPlanLevel,      });      const transactionId = await chargeCreditCard(customer.defaultCard);      await sendInvoice(        customer,         payload.upgradeFee,         updatedSubscription,        transactionId      );      return {        outcome,        payload: {          ...payload,          transactionId,        }      }    }    default: {      isNever(outcome);      break;    }  }}

As you can see in the above function, we are now dealing primarily with the asynchronous parts of the system. In these controller functions, we want as little business logic as possible. The example above is particularly complex; often, controllers only retrieve data from the database and save on success.

Notice, however, that the Controller does not make any business decisions. It only takes actions based on the result of the Deriver. It does not validate the subscriptions, and it does not check any rules.

However, the actions we take in certain circumstances, such as sending an email or charging a credit card, could be something we want to test. In that case, we can use the Partially Applied Controller pattern.

Testing via Partially Applied Controllers

If we want to make the previous example more testable, we can similarly use a partial function as we might use dependency injection in Object-Oriented programming. Let's take a look at an example:

export const createUpgradeSubscriptionController = (  customerRepo: CustomerRepository,  subscriptionRepo: SubscriptionRepository,  sendRepaymentReminder: (customer: Customer, balanceOwing: number) => Promise<void>,  chargeCreditCard: (card: CreditCard) => Promise'id']>,  sendInvoice: InvoiceSender,) => async (  customerId: Customer['id'],   newPlanLevel: Subscription['planLevel']): Promise => {  /* Remaining implementation is identical to the previous example */};// In an adjacent test fileconst fakeSendRepaymentReminder = sinon.fake.resolves();const fakeChargeCreditCard = sinon.fake.resolves('1a2bc');const fakeSendInvoice = sinon.fake.resolves();const upgradeSubscriptionTestController = createUpgradeSubscriptionController(  customerRepoInMemory,  subscriptionRepoInMemory,  fakeSendRepaymentReminder,  fakeChargeCreditCard,  fakeSendInvoice,);

Now I can supply a test implementation of customerRepo & subscriptionRepo while also providing fakes for the other functions. Since this is a much more complex test, we don't want to repeat ourselves testing the Deriver's business rules. Instead, we only want to assert what functions the Controller calls for each of the Deriver's three possible scenarios.

Repositories: Imperative Shell

Repository patterns for retrieving entities on a CRUD basis are not solely a DDD pattern. Nor is it a significantly changed pattern between Functional and OO paradigms. For the sake of fDDD, I will only mention a few constraints to keep in mind for your repositories:

Keep database concerns constrained entirely to your repository layer
Parse, transform, or map all data types going into and out of your repository layer
A generic repository type could look like this:

type Repository = {   getMatching: (query: Query) => Promise;   getById: (id: string) => Promiseundefined>;   create: (item: Omit'id'>) => Promise;   update: (item: T) => Promiseundefined>;   upsert: (item: T) => Promise;   delete: (id: string) => Promise<undefined | MissingEntityError>; };type Query = {  [k: keyof T]: { operator: Operator, value: unknown },}

Where the implementation of a create function may look like:

const subscriptionRepo: Repository = {  create: async (subscription) => {    const objKeys = Object.keys(subscription);    const result = await pool.query(      `      INSERT INTO "subscriptions"      (`${objKeys.join(',')}`)      VALUES (`${objKeys.map((_, i) => `$${i+1}`).join(',')}`)      RETURNING *      `,      Object.values(subscription)    );    return result.rows.map(parseSubscriptionFromDb)[0];  },  // remaining repo functions omitted};

In the above example, our repository maps from the database type back to the domain entity types using the parseSubscriptionFromDb function. This mapping helps us maintain a layer of separation between the database and our implementation. It also allows us to narrow the types further while throwing errors if the database returns unexpected data.

Software Architectural Context

Now that we have defined the patterns of our fDDD implementation, we need to consider its place in our overall software architecture. If we were to implement fDDD in a typical Express.js application, we could implement it as follows:

Our Route Handler for a POST request to /api/subscriptions may look like this:

export const handleSubscriptionPost: RequestHandler = async (req, res) => {  const customerId = req.jwt.customerId;  const { subscriptionLevel, paymentToken } = req.body;  const { outcome, payload } = await createSubscription(    customerId,     subscriptionLevel,     paymentToken  );  switch (outcome) {    case 'PAYMENT_FAILED': {      res.status(400);      break;    }    case 'SUCCEEDED': {      res.status(201);      break;    }  }  res.json({ outcome, payload });};

All our route handler is responsible for is:

Preparing data from HTTP requests for consumption by the Controller
Mapping outcomes back to HTTP status codes

The route handler has no opinions on the business rules, and the domain code has no knowledge of content types or status codes.

When it comes to planning the structure of our codebase, there are primarily two dimensions we could choose as the basis for our files and folders:

Application layers, e.g. database, route handlers, domain, services
Operations, e.g. createSubscription, updateUser, etc

The former has the advantage that all of the code for a particular area is easy to find, but the trade-off is that implementing a change requires opening many folders. The latter has the advantage that all related code for any given change is likely to be nearby, but it is harder to find other code that may operate similarly or be affected by your change.

When we structure our codebase for a Domain Service, our preference is to keep all of the domain code as closely co-located as possible.

This preference means our top-level folder structure will be along the lines of:

However, we will then split it into "entities" and "operations" sub-folders inside the domain folder.

This structure increases the likelihood of reusing an entity's invariants in different Derivers for each operation.

Wrapping Up

Hopefully, this article has shown you how to implement Domain Driven Design within a Functional Programming paradigm and successfully make complex software more manageable. As you can see, fDDD simplifies many of the complicating factors in a classical DDD implementation. Here's a quick refresher on what we have learned:

Entities are at minimum a type definition and may include a parser for run-time validation
Invariants take an Entity as an input parameter, validate it against a simple business rule, and return a boolean
Derivers take one or more Entities as an input parameter, validate it against operation-specific business rules, compose related Invariants, and return a discriminated union of potential outcomes
Controllers coordinate asynchronous parts of the system, including databases, on behalf of Derivers
The wider application accesses the Domain Layer strictly via Controllers

Are you using map, forEach, and reduce wrong?

Anthony Manning-Franklin — Sun, 20 Mar 2022 09:34:42 GMT

In JavaScript, Array.prototype.map, Array.prototype.forEach, and Array.prototype.reduce are used heavily in functional-style programming. However, I meet many developers missing a clear mental model of when or how to use each function.

This is a problem because we use these functions not just for their behaviour but to communicate our code's intent. These functions reveal crucial details about the author's mental model of the code, the problem, and the solution.

In this article, I will attempt to couple a technical understanding of these functions to a semantic understanding in your mind. As a result, you should have a mental model of when to use these functions and why, plus what it means when you read code using them.

Array.prototype.map

Let's start with a simplified version of the official TypeScript definition for map and break it down. I have trimmed the type parameters down to the simplest and most common use case:

map(  callbackfn: (value: T) => U) => U[];

Above, we have a map function that accepts a type argument U, a callback function, and returns an array (U[]).

The callback function takes a value of type T and returns a U. So it accepts a function that will take a value and transform it into another value.

Map calls this transformation function, the callback, for every element (T) in the array. The callback function transforms T into U, which map uses to transform each element, returning a new array of U[].

Now I have lied to you slightly in the above explanation. Transforming implies a mutative operation. However, map does not and should not mutate anything. Instead, it returns an entirely new array (U[]) where every element has a 1 to 1 relationship with each element of T[]. We call the function map because it maps one type onto another. However, thinking of it as a transformation helps us think about when to use it.

Use .map to "transform" an array of one thing into an array of something else.

I will rewrite the above type definition again in a slightly different format, using intention revealing names for the type parameters. Then we will look at some real-world examples.

Array.map(  callbackfn: (value: InputType) => OutputType) => Array;

This is an identical type definition to the first one I showed you. I've included it because I know that seeing it like this will help some people grok it.

In the next scenario, we have an array of Person objects, and we want to map these into an array of ReactNodes:

type Person = {  firstName: string;  lastName: string;  age: number;};type Props = { people: Person[] };export const PeopleList: React.FC = ({people}) => (      {people.map(      (person) => (        {person.firstName} {person.lastName} | {person.age}
      )    )}  
);

In this example, our map function transforms each Person object from the people array into a list item ReactNode. It's a simple use case and tightly coupled to the context we are using it -- the anonymous function we define for the map function isn't beneficial outside of the PeopleList component since it always returns a

node.

Let's consider a use case where we might reuse our callback function in other contexts. In our next example, we are going to map an array of People into an array of full names:

type Person = {  firstName: string;  lastName: string;  age: number;};function getFullnameFromPerson(person: Person): string {  return `${person.firstName} ${person.lastName}`;}function listPeople(people: Person): string {  return people.map(getFullnameFromPerson).join(', ');}

In the above example listPeople passes the getFullnameFromPerson function to map, but we could conceivably use getFullnameFromPerson in other contexts too.

Let's try a more complicated example now, where we want to create a layer between our database implementation and our TypeScript implementation. There are several things map can help us with:

In Postgres, we use snake_case column names, but in TypeScript and JavaScript, we use camelCase. We'll want to map between these types
Enum types often come from another table; we'll map these too
We'll also map our 64bit integer ids from strings to BigInt

Javascript Types:

type ArticleType = 'GENERAL' | 'REVIEW' | 'EDITORIAL';type Article = {  id: BigInt;  articleType: ArticleType;  author: string;  title: string;  urlSlug: string;  content: string;  publishedAt: Date;}

Database Types:

type ArticleRow = {  id: string; // Bigints come back from pg-node as a string  article_type_id: number;  author: string;  title: string;  url_slug: string;  content: string;  published_at: Date;}type ArticleTypeRow = {  id: number; // small int  type: string;}

Mapping:

Now we have our cast of types, let's consider how we might map from these rather rough database types to these very usable javascript types. I'll write a function that selects the ten most recent articles, and we will use a series of maps to transform the query results.

const getRecentArticles = async (): Promise => {  const result = await pool.query(`    SELECT * FROM articles ORDER BY published_at LIMIT 10;  `);  return Promise.all(result.rows.map(getArticleFromArticleRow));}const getArticleFromArticleRow = async (row: ArticleRow): Promise => ({  id: BigInt(row.id),  articleType: await getArticleTypeById(row.article_type_id),  author: row.author,  title: row.title,  urlSlug: row.url_slug,  content: row.content,  publishedAt: row.published_at,});/**  This function memoises lookups to article type.*/const getArticleTypeById = (() => {  let cache: Map<string, ArticleType>;  return async (id: number): Promise => {    if (!cache) {      const result = await pool.query(`        SELECT * FROM article_types      `);      cache = result.rows.reduce(        (current, previous) => previous.set(current.id, current.type),        new Map()        // This gives us an object where we can lookup article types by id      );    }    return cache.get(id);})();

The first function in the above example, getRecentArticles, simply returns the ten most recently published articles.

If you're wondering why we use Promise.all in the return statement, it is because our mapping callback function getArticleFromArticleRow returns a Promise. That means that the result of result.rows.map(getArticleFromArticleRow) is an array of promises. Still, the caller of getRecentArticles expects a single Promise containing an array of Articles.

Promise.all takes an array of promises and maps them to an array of resolved values within a single promise, thus fulfilling our contract to the caller of getRecentArticles.

Why did we need getArticleFromArticleRow to be asynchronous (return a promise)? Because we look up the article type from the article type enum table in Postgres. You are correct in thinking that a Postgres query for every row in getRecentArticles is wasteful and slow. That is why getArticleTypeById actually caches the value locally.

Looking at the implementation, we use an immediately invoked function expression to create a cache value protected in the closure of the function definition for getArticleTypeById. This closure means no other part of the application can access cache, but all calls to getArticleTypeById utilise the same cache. It is effectively a functional singleton, providing a memoised lookup, so we only have to query the database for ArticleTypes once per node process.

As you can see, Array.prototype.map is a powerfully simple concept. You can build complex transformations with it, create asynchronous processes, build recursive solutions, and even chunk and distribute a map across multiple processes because it is monadic.

Now that we understand map let's see why forEach is different.

Array.prototype.forEach

Let's start with a simplified TypeScript definition of forEach like we did with map:

forEach(  callbackfn: (value: T) => void): void;

This looks pretty similar to map, except that the callback function doesn't return anything, and neither does forEach itself. Why might we want a function that returns nothing for each element of an array? Because we want to create side effects.

Use forEach for creating side-effects beyond the local scope

Let's understand side effects quickly by thinking about its opposite -- a pure function. Pure Functions always return the same value given the same inputs. The synchronous map functions in the previous example were pure, but the asynchronous functions accessing a database were impure because the return value could change between calls.

A function can also have a side effect when it affects part of the system beyond its input or output values. Let's move on to a concrete example.

If we wanted to send an email to every user in an array of users, we would use the forEach function if we do not care about the aggregate outcome of the email sending function. I'll give you an example, where our handleMeetingBooked function will email all attendees:

type Meeting = {  name: string;  attendees: User[];  date: Date;}type User = {  email: string;  name: string;}const handleMeetingBooked = (meeting: Meeting): void => {  const { attendees } = meeting;  attendees.forEach((attendee) => {    sendMeetingEmail(attendee, meeting);  });}const sendMeetingEmail = (attendee: User, meeting: Meeting): void => {  const emailBody = `Hi ${attendee.name},    you have been invited to ${meeting.name}     on ${meeting.date}     with ${meeting.attendees.map(({name}) => name).join(', ')}`;    sendEmail({ to: attendee.email, body: emailBody });}

Notice how no part of handleMeetingBooked cares about what happens inside the callback function provided to attendees.forEach. We choose forEach instead of map because the behaviour is different and because it tells the readers of our code that the application shouldn't care about or depend upon the outcome of this email sending function.

Could I implement identical behaviour with map? Absolutely. Should I? Absolutely not.

Let's look at a typical forEach code smell -- mutating an external collection from within the callback provided to forEach.

// BAD CODE, DO NOT DO THISconst getUsersNames = (users: User[]): string[] => {  const names = [];  users.forEach((user) => {    names.push(user.name);  });  return names;}

What's wrong with the above code? Aside from a few unnecessary lines, it lies to the reader. It tells the person reading this code that the callback function won't change anything within the function, and then it mutates one of the function's variables! It mightn't seem so bad in a small function like this, but this is a recipe for bugs in larger functions.

Code smell: Using .push inside forEach. Consider alternative e.g. .map or .filter

The correct implementation of the above would simply be:

const getUsersNames = (users: User[]): string[] =>   users.map(({name}) => name);

Now that we have understood the difference between map and forEach, it is time to consider the role reduce plays in our code.

Array.prototype.reduce

Reduce is the array function that seems to cause the most confusion. Rather than starting with the type definition, I will offer you my mental model of what reduce is for, and then we can look at the code and see how it supports this way of thinking.

Use reduce to reduce a collection of elements to a single aggregate entity.

That is to say; I use reduce when I have many (an array) and want one (anything but an array). A typical example of this is summing the numbers in an array.

Let's take a look at a simplified version of the official type definition as we did before, this time keeping the goal of reducing down to a single aggregate entity in mind:

reduce(  callbackfn: (previousValue: U, currentValue: T) => U,   initialValue: U): U;

The first thing to notice is that reduce returns a single U where map returned U[]. We can also see that we can supply an initialValue, but the type of this value must match the return type of the reduce function and that the callback function must also return this same type.

I'll write this again with intention revealing names:

Array.reduce(  callbackfn: (previousValue: OutputType, currentValue: InputType) => OutputType,   initialValue: OutputType): OutputType;

We'll reconsider the summation example, equipped with our new mental model and expanded type definition fresh in our minds:

const total = [1, 2, 3, 4, 5].reduce(  (previous, current) => previous + current,  0);console.log(total); // 15

Above, we started with a collection of numbers and wound up with just one number. We reduced it down to its aggregate: the variable total with a value of 15.

How else could we use this? Perhaps we want to reduce an array of users down to a count of common names:

type User = {  firstName: string;  lastName: string;}type CommonNamesAggregate = Record<string, number>;const countCommonNames = (users: User[]): CommonNamesAggregate => {  return users.reduce(    (aggregate, user) => {      const currentCount = aggregate[user.firstName];      if (currentCount) {        aggregate[user.firstName]++;      } else {        aggregate[user.firstName] = 1;      }      return aggregate;    },    {} // We create our new CommonNamesAggregate here  );

In this case, our new aggregate entity is an object where each key is a first name from users, and the value is a count of the frequency of that name in the users array.

Let's write another flavour of the same countCommonNames solution and see if it helps us grok reduce:

const countCommonNames = (users: User[]): CommonNamesAggregate => {  const uniqueNames = new Set(users.map(({firstName}) => firstName));  const commonNamesAggregate = [...uniqueNames].reduce(    (obj, name) => {      obj[name] = 0;      return obj;    },    {}  );  return users.reduce(    (aggregate, user) => {      aggregate[user.firstName]++;      return aggregate;    },    commonNamesAggregate  );

In this implementation, we first create the aggregate object separately and initialise its values to 0 by reducing our set of unique names, saving us from checking if the name already exists in our final reducer.

Now let's revisit forEach regarding the above code example. I'll implement the same behaviour using forEach, and then explain why it is a code smell.

// BAD CODE, DO NOT DO THISconst countCommonNames = (users: User[]): CommonNamesAggregate => {  const uniqueNames = new Set(users.map(({firstName}) => firstName));  const commonNamesAggregate = [...uniqueNames].reduce(    (obj, name) => {      obj[name] = 0;      return obj;    },    {}  );  users.forEach(    (user) => commonNamesAggregate[user.firstName]++  );  return commonNamesAggregate;};

This might look simpler because it has fewer lines of code, but it gives the reader less information about the intent of the code. By using forEach, we tell the reader to hold onto their hats and read carefully because we're about to do something. The issue is that the only clues about what that something is, come from code that is mainly outside and before the forEach callback.

Code smell: Mutating a local variable from within a forEach.

Ultimately, forEach is the array function with the least semantic meaning and the least ability to reveal intentions. As such, forEach should be used as a last resort when there isn't a more suitable function or when you explicitly want to indicate that what occurs within the callback is not relevant to the adjacent code.

I also want to talk briefly about a code smell in reduce usage: returning an array from reduce.

Code smell: Returning an array from reduce. Consider map or filter instead.

People usually do this by mistake when they want to chain .filter and .map together, but instead, they do it in one function. The damage from this anti-pattern is that we mislead readers of our code entirely.

Conclusion

As programmers, we spend a lot of time working with collections of things, whether arrays, sets, maps, or objects. JavaScript comes with a fantastic suite of array functions that can make our days easier and our code more readable. Hopefully, this article has given you the confidence to know which array function to use and why.

TLDR

Use .map to "transform" an array of one thing into an array of something else
Code smell: Mutating a local variable from within a map.
Use forEach for creating side-effects beyond the local scope
Code smell: Using .push inside forEach. Consider alternative e.g. .map or .filter
Code smell: Mutating a local variable from within a forEach.
Use reduce to reduce a collection of elements to a single aggregate entity.
Code smell: Returning an array from reduce. Consider map and/or filter instead.

Your Microservices Are Slowing You Down, could Domain Services Boost Productivity?

Anthony Manning-Franklin — Sat, 19 Feb 2022 01:53:48 GMT

The benefits of a Microservices Architecture are well known; they reduce coupling, allow independent deployments, and increase the rate of change in our applications, making product managers love us, finance teams applaud us, and CEOs offer us big bonuses. Or so I keep being told at conferences.

However the costs of a Microservices Architecture are not talked about nearly as often as they should be. The trade off when we create a new Microservice is increased technical complexity, increased coupling, dependent deployments, greater risk of introducing distributed transactions, and overall a decreased rate of change in our applications.

Wait a minute! The costs sound almost identical to the benefits, so what's going on here? Some of the dogmatic enthusiasm for Microservices is due to our familiar old friend the causal fallacy. Microservices architectures correlate with other beneficial behaviours, up to a point, beyond which their costs exceed their benefits and the approach begins providing negative value.

This benefit is non-linear, the first few Microservices provide a lot of extra value! But these returns quickly diminish, until the Microservices architecture itself becomes a hindrance, in some instances reducing productivity to the point where change is now harder than it was before.

The irony of the negative value phase is that the system has created a feedback loop where the cost to create a new Microservice is low, and the pain is distributed across teams/engineers in the organisation. No one can solve the problem individually, everyone must agree, but whoever continues creating new Microservices reaps the benefits of another team's good behaviour. Microservices architecture can thereby create a system implementing tragedy of the commons. This is how companies wind up with thousands of Microservices, even after the approach shows its flaws.

My goal in this article is not to persuade people that Microservices Architecture is bad. Instead, I want to expose the root cause behind the initial rise in productivity with Microservices, and then find ways to maintain or extend it. My objective is to create an approach that helps us find and stay at the top of the hill where we have the greatest rate of change. I want to give us a set of tools to know when to apply this architectural approach, and how far to go.

The reason Microservices work so well at first is because, when you break up your system into tiny components, it is highly improbable that a single new service with a single responsibility will cross domain boundaries. Your first Microservice was a success, so you build a few more. Along the way you re-allocate these services between teams quite easily as you intuit where the domain boundaries are in your business. So far so good, and quickly you build up tens, then hundreds, and maybe eventually thousands of Microservices, as Uber did.

This is what I call the shotgun approach to Microservices architecture, and it usually occurs due to a lack of design. If you make the pieces small enough, no one has to do the hard design work of identifying bounded contexts or designing domains up front. Instead you can always trade service ownership between teams. The person who championed the approach in the beginning gets lots of praise as the team climbs the hill in the above diagram.

And then the wheels fall off. The top of the hill is a narrow precipice.

Probably around the time you (accidentally) create your first distributed transaction, you realise your Microservices aren't so decoupled, and perhaps you have built a distributed monolith. Every feature that made them great before, now makes them challenging. If you make a change to a service, you can't reasonably tell which other services are directly or indirectly affected. Observability becomes a nightmare, where a single user interaction may be handled by a dizzying number of services talking to each other. What was once function calls in code has now become distributed network requests that must be traced. An outage in one service results in a set of failures that light up your alerting systems like a Christmas tree, making the root cause a nightmare to diagnose.

Whole new solutions are invented to solve problems that teams created for themselves, further increasing complexity and the likelihood of unpredictable emergent behaviour creating incidents of change failure, which themselves become increasingly difficult to solve.

You might be building and deploying small Microservices independently, but the meaningful non-functional requirements live at the system level. You can't reasonably test the suitability of a Microservice deployment in isolation, because the system behaviour is contingent on the whole. As we reduced the responsibilities of a service down to a single purpose, we too reduced it's overall contribution to the -ilities that make our system manageable.

Likewise, Microservices may be deployed independently, but meaningful, value-adding change that the business asks for requires changes to multiple Microservices. With too many services, these changes themselves cascade across services. You've gone from one deployment containing many changes, to one change requiring many deployments.

What's the alternative? Like abstract versus concrete classes, decoupling vs cohesion, the answer is applying things appropriately, and moderately. The key benefit of the first few Microservices is that we create an architecture where teams are empowered to deploy code changes that align with the axis of change in the business. So how do we define moderate and appropriate?

Typically, the first system we split out is one that is obviously discrete, such authentication or payments. The boundary between the new service and the old service is easy to identify: a small surface area of interaction, either side of which the two services are relatively independent. It's easy to get right, hard to get wrong the first time. But taking this first win and going from a crawl to a sprint is ill advised.

Instead, take the time to do planning and analysis before creating your Microservices. This planning typically has to occur sometime after the business has actually begun operating. Planning a Microservices architecture in a greenfield product before achieving product market fit is a sure way to get the bounded context wrong. You're aiming for a fast moving target. During the incubation phase, there's more to be gained by focusing on building differentiators and buying the rest than there is in architecting Microservices.

However with a product in the growth or extraction phase, you can analyse existing code in a well architected monolith, and don't split your services if you can't create a well architected monolith first. Look for clear business domains with very few dependencies on other parts of the code. These are vertical slices of the code you are likely to be asked to change by a product manager, and would be relatively easy to do so in one atomic change.

One technique is to go through Jira comparing past tickets to the code base. The crucial factor here is to align your domains along the axis of change, and our ticketing system is a great way to identify what tends to change discretely and what changes dependently.

You see, the rate of change is determined by the number of dependencies affected by that change. This is true in terms of code AND organisation; that is, your team structure should reflect your business domains which should reflect your code and your architecture. The more these four elements align, the lower the resistance and the faster the rate of change.

The perfect example of this is simply aligning services or teams along the wrong axis. That is, if you have a frontend Microservice, an API gateway Microservice, a few business logic services, and something crazy like a database service. Then imagine that each service had its own team. In this (slightly) contrived example, we have built teams and services around the layers of our application, rather than vertical slices. Since no change can occur without affecting all layers, teams AND services have to communicate a lot in order to deliver change or value.

A Microservices Architecture may start out looking like the diagram on the right, but if you're not careful, the quest for smaller and smaller services will inadvertently lead teams to create something akin to the diagram on the left. Remember, you're optimising for changeability, not reusability. They are two competing goals!

What you ultimately want to do is apply Domain Driven Design to find a small number of discrete, well bounded domains. Then you can structure your teams, code, architecture, and business around these domains.

I believe that finding the correct name for this approach and its services is crucial. The term Microservices seems to imply that more and smaller is better. However as we have seen, discrete and well bounded is far more important than size or number. That is why I have chosen to call them Domain Services*. This name reveals our true objective; aligning teams, code, and architecture with business domains.

My rule of thumb is that the number of Domain Services an engineering department should aim for is the number of feature engineers divided by three or five. That means a team of 10 engineers developing features should have 2 or 3 Domain Services, not 20! A team of 30 feature engineers could have 6 to 10 Domain Services.

This ensures that the services are large enough that they are easy to maintain transaction boundaries within them. The number of discrete domain boundaries that you have to find is reduced, and teams only have to create and maintain a contract/context map for the entities that are communicated across that exposed boundary, keeping communication costs low while greatly enhancing rate of change.

You may have heard that "Microservices should do only one thing!" but I would argue that this only seems true because it implicitly enforces the more accurate rule:

A domain must only be implemented in one service, but a service may implement more than one domain

This matches its sister rule in Domain Driven Design:

A domain must only be worked on by one team, but a team may work on more than one domain

Each Domain Service you add should be planned and executed with extreme caution! Every time you add a service, you increase the number of exposed domain boundaries rapidly. An exposed boundary is a domain boundary that is implemented across the network, between services. Every exposed boundary introduces an API surface that enforces a contract. A contract must be maintained or versioned through changes.

The more boundary exposure you have, the slower your rate of change behind that boundary. Heuristically speaking this means that if your service is too small, the ratio of behavioural code and API code tilts toward API code. As it does the utility of the service approaches zero, since any change in behaviour will more likely result in a change in API. The more Microservices you have, the smaller their behavioural code, and the greater their boundary exposure, soon every Microservice is more API than behaviour and productivity halts.

The reason we began creating Microservices in the first place was to increase the rate of change in behavioural code a benefit we lose as we add more services. The business doesn't ask us to change API code, the business asks us to change the behaviour of the system. The API is meant to help facilitate that change, not hinder it.

For example, two Domain Services can have only one boundary and zero possible dependent services, three services gives us three boundaries, four services gives us up to six, and so on. This is without considering indirect dependencies, which can accumulate even faster.

Services	Boundaries	Indirect Dependencies
2	1	0
3	3	3
4	6	12
5	10	30

The indirect dependencies column describes how a command from one service to another could subsequently depend on the remaining services. That is, if we have 4 services, for each service one of those services could issue commands to, each of those could depend on the two remaining services. When all of those possible dependencies are accounted for at a single depth, we have 12 opportunities for dependent calls.

Of course, these numbers represent the upper bound, the worst case scenario. Importantly they show how too many Microservices quickly turns into a new big ball of mud the very thing we wanted to avoid, just with added network, concurrency, and deployment issues.

As you can see from the above table, the change from two Domain Services to three is almost as important as the change from one to two because a third service introduces new types of complexity. You will likely be carving this new Domain Service out of an existing one, so you must be thorough. Check that the candidate Domain Service doesn't cross transaction boundaries, investigate all the events and commands within the domain, and work with your product manager and/or business analysts to ensure your model of the domain matches theirs.

Modelling domains as a collection of commands, events, and aggregates, entities, or projections helps a lot with this process. Even if the code here is well established, it can be worth running an event storming session.

At this point in the article it is worth running through a definition of commands and events, because understanding the distinction between the two is critical. The simplest definition I have come up with is this one:

A command is a request to change the persisted state; An event is the delta of that change.

That is, every time you would write to a database, whether that is an insert, update, or delete, emit an event which describes what changed (NOT the new state!) An event cannot be rejected because it is a historical statement of fact. It might not be a happy fact, but it is one nonetheless.

Every time you have the intent of changing state, issue a command, knowing that it could be rejected. Commands can have outcomes, which might be the reason the command was rejected, or it might be acknowledgement that the command was accepted. When a command is accepted, an event must eventually follow.

It's important to note that events are the transaction boundary. In an event based system with multiple services, your system has to be and expect eventual consistency.

Let's say your checkout service issues a CompletePurchase command, and checks its simplified inventory data before accepting the command. It then emits a PurchaseCompleted event to the inventory service,which has a much more sophisticated understanding of inventory. At the same time, the inventory service has sent a StockAdjusted event because someone has reported an item as damaged at the warehouse. The inventory service now realises that it can't service the order because the available stock is no longer available. Should it reject the PurchaseCompleted event?

No. The purchase happened, the user's credit card has been charged. This is a matter of historical fact and cannot be rejected. You might say we should have prevented the purchase then, "if only we had kept these domains together!" you lament. But remember, the inventory domain only updated its available inventory count after the damage physically occurred in the warehouse. The real world itself is eventually consistent anyway!

In this example, you can see that the domain events are the true transaction boundary. A PurchaseCompleted event might lead to an ItemAllocated event from the inventory domain in the green path, reducing the available stock count, but the fact that these two events are correlated does not make them an atomic action.

If you model your domains this way, you will find that the domain boundaries are quite clear, and that the events and commands that are shared between them define the API boundaries between our Domain Services.

The reason this distinction between commands and events is crucial is because it relates to another key rule of Domain Services:

Communication between Domain Services should consist of events, not commands

If your Domain Services are issuing lots of commands to each other, this is an indication that your domain boundary is incorrectly placed. A single user interaction should result in a command within a single Domain Service. The primary Domain Service handling the initial command may dispatch one or more events to other Domain Services, which may themselves issue commands to themselves on receipt of the event (but not when replaying the event from their own event store). However these must be events, not commands, and must fail independently.

The result is that a user interaction may be satisfied by a single Domain Service without issuing dependent commands. This does not however, preclude additional responses from the events occurring asynchronously. E.g. if the Checkout Service emits a PurchaseCompleted event then the Notification Service may email the user an invoice based on the event it received and their preferences it has stored. The Dispatch Service may issue a CreateFulfilmentOrder command and begin the process of shipping goods to the user. Each of these commands may fail, but they fail independently.

Let's compare this to a shotgun Microservices Architecture implementation. In the shotgun approach, the CompletePurchase "command" may be a call to a Microservice that validates the cart, which then calls another service to check the billing, while also calling another service to reduce the stock levels, and another to create a carrier label for shipping, another service to update the users order history, and another service to email the invoice. All of these calls may be commands to other services, and all of them may fail in dependent ways which are then (hopefully) fed back to the user.

Coming back to the crux of this article, I am confident that a team of 35 developers working across 7 Domain Services will in the majority of cases, outperform a team of 35 developers working across 60 Microservices. Of course, no quantitative measure exists, but the stories of development teams reducing and reorganising their Microservices provides qualitative evidence to support my hypothesis, in addition to my personal experience, and deductive reasoning outlined in this article.

Ultimately the goal of any architecture is to facilitate ease of change into the future, without sacrificing our target non-functional requirements. This architectural objective is predicated on our ability to predict the nature of changes into the future. Of course, all predictions are imperfect, and increasingly so the further our outlook. However I believe Domain Services better aligns our architecture with our team structure and business structure by utilising tools such as Domain Driven Design to identify numerous independent (or at least loosely coupled) axes of change in our systems.

Summary of Heuristics and Rules

We covered a lot of ideas in this article, so we will quickly recap the key thoughts. I have split these into heuristics and laws.

The heuristics are rules of thumb that are usually true in any software system with "services", whether that is Microservices or Domain Services. I find these are useful ideas to keep in mind when consider the trade offs of introducing new services.

The laws are rules that define best practices for a Domain Services approach, and thus uses the language of RFC 2119

Heuristic of Multi-Service Systems

As new services are added, complexity increases non-linearly
As new services are added, rate of change increases by at most +1
The total rate of change increases as the deployment to change ratio approaches 1:1
The rate of change is reduced by the number of dependent/affected systems/teams for each change
The utility of a service is its amount of behavioural code compared to API code
Increasing the number of services reduces the amount of behavioural code in each service
Increasing the number of services increases the amount of API code in each service

The Laws of Domain Services

A domain must only be implemented in one Domain Service, but a Domain Service may implement more than one domain
A domain must be owned by only one team, but one team may own more than one domain
Domain Services must only communicate asynchronously
Domain Services must communicate via events
Domain Services should not dispatch commands to other services
Events must be an atomic transactional boundary
Events must never be rejected (but they can be ignored)
Commands must be a rejectable request to change the persisted state
Events must be a semantic encapsulation of the delta of altered state, and an immutable historical fact
An accepted command must produce an event
A Domain Service may issue a command to itself when it ingests an event from another Domain Service
Domain Services must not share persistence layers
A Domain Service should only receive data it requires from other Domain Services via events
A Domain Service should not request additional data from another Domain Service in order to handle a command

Footnotes

* Fans of Domain Driven Design, especially those whom follow the Object Orientated implementation, might recognise the term as an implementation level pattern in code. Given the term isn't referenced in DDD: Distilled, seems to cause confusion, and isn't well distinguished from other DDD patterns, I feel that we can make better use of this term at the architecture level. Therefore I have stolen and repurposed it, sorry not sorry.

Deliver a Meaningful Tech Strategy While Still Shipping Features with The Three Stream Backlog

Anthony Manning-Franklin — Sun, 21 Nov 2021 07:15:08 GMT

It's a story as old as time, boy meets girl, girl falls in love with incomprehensible eldritch horror oh wait no, sorry wrong story, this one is about TECH DEBT!

In this painfully familiar story, before every other sprint, stakeholders in the business raise some crucial feature that needs to be developed as quickly as possible. The business proclaims "it's an emergency!" (it isn't), and the team is asked to take on more technical debt and told "don't worry, we'll make time to deal with it later". Every day since has been "today" and not one of them seems to be this day called "later". The engineers have stopped wondering if later will ever come...

Over time, the team's productivity asymptotically approaches zero. Product managers are getting frustrated, engineers are frustrated, stakeholders have given up expecting anything from Product (unless they declare it an emergency), and the tech debt seems insurmountable. Developing new features is neither easy nor enjoyable. Everyone has a different solution, from re-writes to an entire quarter without feature work.

In this article, not only do I offer a viable way out of the morass, but a strategy to ensure things don't get this bad in the first place. It's outstandingly simple, and requires only a little focus to execute on. I call it The Three Stream Backlog

The what now?

If this idea catches on I'm going to really regret not coming up with a better name, but The Three Stream Backlog is dividing your backlog up into well... three streams of work.

Feature Work

This stream contains the epics that Product Managers and the team have poured blood, sweat, and tears into developing and preparing for development. The epics in this stream have been through discovery, user research, user testing, prototyping, tech spikes, technical feasibility planning, etc. The team is confident that these epics are a good use of engineering resources.

We're not going to spend too long talking about this stream because the entire Product Management discipline is devoted to doing this well.

Allocate 50 - 70% of the team's engineering resources to this stream.

Business as Usual

This stream contains only tickets for bug fixes, copy improvements, legal document updates, and minor changes. There are no epics in this stream! It's really important that large features do not try to sneak in.

Allocate 20 - 25% of the team's engineering resources to this stream.

Technical Work

This stream contains both tech debt tickets AND epics. However, these epics are created by engineering leadership with the same rigor, research, planning, and confidence in deliverable value as the epics in the feature stream.

This means engineering leadership need to learn some product management skills and maintain their own backlog. The crucial step here is really stepping back and looking at the big picture. Make sure your tech backlog supports and empowers the team to deliver on the feature roadmap.

Does that mean you need to plan an upgrade to Hasura to support web hooks for the upcoming integration feature? Do you need to implement a repeatable solution to building audit tables alongside the mutable tables that Data & Analytics teams are relying on for their reports? Is the developer experience slowing everyone down and causing "It works on my machine!" bugs? Maybe deployments are too difficult and error prone.

The tech stream is partially recursive, because you will find it contains a stream of big features and smaller bug fixes, just like the three streams themselves. You need to put on your Product Management hat and figure out how to efficiently allocate limited resources to seemingly unlimited problems. This leads me to my next point.

Executing on The Three Streams

The critical detail in implementing this effectively is allocating people to particular streams for entire sprints. Context switching kills focus, kills productivity, and saps our team's morale.

So given a team of 4 engineers, you would allocate them to the 3 streams as such:

Resources	Stream
2x Engineers	Feature
1x Engineer	BAU
1x Engineer	Tech

This focus is the critical detail that lets you plan big, empowering tech epics and deliver serious improvements to the infrastructure, developer experience, and non-functional requirements of the product! You now have dedicated resources each sprint to work on the tech.

What's the alternative? I think we have all tried the "30% of the sprint's story points are for tech debt tickets" approach. Has this ever worked for anyone? The problems with that approach are:

You can't plan significant tech work
Tech debt tickets are prioritised last in the sprint, meaning they get done in a rushed and haphazard way, if picked up at all
You can't predict delivery of tech work
Often, no one is really looking after these tickets or planning the tech backlog

In the end, the story point approach leaves the tech team feeling even more disempowered, and the technical challenges seemingly even more insurmountable.

Tips and Tricks

Win over Product Managers

Product Managers might push back on the idea of them handing over up to 25% of the available engineering resources every sprint for tech to work on their own backlog. However, remind them that this means they don't have to prioritise tech debt tickets ever again. It means the team will deliver faster and more efficiently. It means the team will be able to estimate work better because the code is easier to work with. It means deployments will become more reliable and less stressful.

Try it for a quarter and ensure you deliver a significant technical challenge that improves your ability to deliver Product features reliably. Then, tell everyone that will listen that this was possible because you were able to plan technical work via the three stream backlog and deliver it with uninterrupted focus. When I first introduced it, my team introduced Continuous Deployment for the first time in a business that many thought would never pull it off.

Avoid Stream Fatigue

It's important to swap people around between the feature, BAU, and tech streams. Everyone needs that shared context, and often the things people learn working in one stream will make them more effective in another.

However, be mindful not to break peoples' flow. Let people stay in the tech stream until their technical epic is complete, and same for the feature stream. Since the BAU stream is made up entirely of small tickets, it's easier to swap people in and out of this stream between sprints than the feature and tech streams.

Report on Outcomes with Discipline

As a new Product Manager of the technical backlog, you need to put in place some systems to measure the success of your tech epics. This means your epics need to not only be well formed coming in, but well analysed on the way out. I do this by creating a report in Confluence for each big feature, and include:

What went well
What could be improved next time
What outcomes occurred that I expected (i.e. reduced lead time, increase release frequency, better team health metrics, etc)
What outcomes occurred that I did not expect

When people ask you if this approach is working, you will have a repository of delivered work and measured outcomes to direct them towards.

Process Compatibility

So far I've seen the three streams work best on agile teams running Scrum with sprints. It works decently on Kanban teams, but swapping engineers between streams gets a little messy.

It is not compatible with Shape Up, but then I believe Shape Up is incompatible with life, healthy teams, kittens, and basically anything good and bright in the world.

Stop catching errors in TypeScript; Use the Either type to make your code predictable

Anthony Manning-Franklin — Sat, 23 Oct 2021 13:35:20 GMT

In some languages such as Java, methods or functions can provide type information about the Exceptions or Errors they may throw. However in TypeScript, it is not possible to know what Errors a function may throw. In fact, a function could throw any value, even a string, number, object, etc. This is why TypeScript types caught values as unknown since v4.4.

So what do we do about all the "negative" outcomes of a function call? If I am writing a handleLoginUser function, and the supplied password is invalid, or the account isn't active, how do I implement this effectively in TypeScript?

A lot of programmers who are new to TypeScript might implement these cases as classes extending Error and then throw those classes. This approach has some flaws, let's take a look:

export const handleLoginUser = async (username: string, password: string): Promise => {  if (username === '') {    throw new EmptyUsernameLoginError();  }  if (password === '') {    throw new EmptyPassswordLoginError();  }  if (isCorrectUserPasswordCombo(username, password) === false) {    throw new InvalidCredentialsError();  }  const user = await getUserbyUsername(username);  if (user.active === false) {    throw new InactiveUserError();  }  return user;}

Most of this function is actually about the things that can go wrong, but our types only inform us of the successful path. That means 4/5ths of the function's output is untyped!

The above "exceptions" or "errors" aren't really exceptions or errors at all. They are outcomes. They are predictable, reasonable parts of our system. My heuristic is, if they are something a good product manager would care about, they are not exceptions and you shouldn't throw them!

Exceptions are unpredictable things we cannot reasonably plan for, that the system should not attempt recovery from, and we should not route to the user.

There is an issue in the above code that is less obvious at first glance; If the code calling handleLoginUser is catching these errors and using them to display messages to the user, then there is a real possibility that a serious system issue could get caught in the same error handling code and displayed to the user.

Not only is it a terrible user experience to get a stack trace in the UI, it is a security risk too. Don't make exceptions handling code responsible for both business cases AND exceptions!

Typing the red paths

How could we make these different outcomes more visible in the type system? One option is to build a discriminated union of outcomes. Another, complementary approach, is to use an Either

An Either is a data type that holds some value in a property called left OR some value in a property called right, but never both at once, and never neither. In set theory we would call it a disjoint union, as opposed to the typical union | we might use to create an optional type type Optional = T | undefined.

Let's have a look at how we would define an Either in TypeScript:

type Left = {  left: T;  right?: never;};type Right = {  left?: never;  right: U;};type Either = NonNullable | Right>;

Nothing ground shattering so far, now TypeScript will let us define a left value or a right value but never both:

const validLeft: Either<string, string> = {left: 'foo'}; // validconst validRight: Either<string, string> = {right: 'foo'}; // validconst invalidBoth: Either<string, string> = {left: 'foo', right: 'bar'}; // Invalid, won't compile

Why left and right? What gives? The convention is that left is used for failure cases and the right hand side is used for success cases. The reason is actually that "right" is a pun or synonym for correct.

When Either's are used for success/failure paths, they are called biased Either's. When they hold two potential types for a purpose unrelated to success or failure, they are referred to as an unbiased Either. For the rest of this article we will focus on the biased Either.

Let's add a few helper functions so that we can more easily use the Either type in our handleLoginUser command:

export type UnwrapEither = (e: Either) => NonNullable;export const unwrapEither: UnwrapEither = ({  left,  right,}: Either) => {  if (right !== undefined && left !== undefined) {    throw new Error(      `Received both left and right values at runtime when opening an Either\nLeft: ${JSON.stringify(        left      )}\nRight: ${JSON.stringify(right)}`    );    /*     We're throwing in this function because this can only occur at runtime if something      happens that the TypeScript compiler couldn't anticipate. That means the application     is in an unexpected state and we should terminate immediately.    */  }  if (left !== undefined) {    return left as NonNullable; // Typescript is getting confused and returning this type as `T | undefined` unless we add the type assertion  }  if (right !== undefined) {    return right as NonNullable;  }  throw new Error(    `Received no left or right values at runtime when opening Either`  );};export const isLeft = (e: Either): e is Left => {  return e.left !== undefined;};export const isRight = (e: Either): e is Right => {  return e.right !== undefined;};export const makeLeft = (value: T): Left => ({ left: value });export const makeRight = (value: U): Right => ({ right: value });

Applying `Either` to `handleLoginUser`

Let's take a look at our new implementation of handleLoginUser when we return an Either instead of throwing:

type LoginError =  | 'EMPTY_USERNAME'  | 'EMPTY_PASSWORD'  | 'INVALID_CREDENTIALS'  | 'INACTIVE_USER';export const handleLoginUser = async (username: string, password: string): Promise => {  if (username === '') {    return makeLeft('EMPTY_USERNAME');  }  if (password === '') {    return makeLeft('EMPTY_PASSWORD');  }  if (isCorrectUserPasswordCombo(username, password) === false) {    return makeLeft('INVALID_CREDENTIALS');  }  const user = await getUserbyUsername(username);  if (user.active === false) {    return makeLeft('INACTIVE_USER');  }  return makeRight(user);}

The first thing you should notice is that we have types for every possible case in our function. The great thing about this is that now the caller can see every possible outcome in the function return type. It's clear to the caller that they will get a User object if the function succeeds, or one of 4 possible failure types.

Let's have a look at the caller code:

app.post('/login', async (req, res) => {  const { username, password } = req.body;  const loginEither = await handleLoginUser(username, password);  if (isRight(loginEither)) {    const user = unwrapEither(loginEither);    res.json({ user });    return;  }  const error = unwrapEither(loginEither);  switch (error) {    case 'EMPTY_USERNAME': {      res.json({ error: 'You must supply a username to login' });      return;    }    case 'EMPTY_PASSWORD': {      res.json({ error: 'You must supply a password to login' });      return;    }    case 'INVALID_CREDENTIALS': {      const attemptsRemaining = await handleInvalidCredentialsAttempt(username);      if (attemptsRemaining === 0) {        res.json({ error: 'You have made too many attempts and been locked out.' });        return;      }      res.json({        error: `Invalid username and/or password, you have ${attemptsRemaining} attempts remaining`,      });      return;    }    case 'INACTIVE_USER': {      res.json({ error: 'Check your email for an activation link' });      return;    }    default: {      isStrictNever(error);    }  }});

The above code is much more maintainable. We get great type hinting in our switch statement, and exhaustiveness checks. Meanwhile, the green path is lean.

Of course, one of the great things about this approach is that we could clearly refactor these into a handler function for the success case, and a handler function for the failure case if we wanted to. In fact, let's do that now:

export const handleLoginError = async (loginError: LoginError): Promise<string> => {  switch (error) {    case 'EMPTY_USERNAME': {      return 'You must supply a username to login';    }    case 'EMPTY_PASSWORD': {      return 'You must supply a password to login';    }    case 'INVALID_CREDENTIALS': {      const attemptsRemaining = await handleInvalidCredentialsAttempt(username);      if (attemptsRemaining === 0) {        return 'You have made too many attempts and been locked out.';      }      return `Invalid username and/or password, you have ${attemptsRemaining} attempts remaining`;    }    case 'INACTIVE_USER': {      return 'Check your email for an activation link';    }    default: {      isStrictNever(error);    }  }}

This function will handle our error cases and return an error string for the user. Let's implement the improved function in our route handler:

app.post('/login', async (req, res) => {  const { username, password } = req.body;  const loginEither = await handleLoginUser(username, password);  if (isRight(loginEither)) {    const user = unwrapEither(loginEither);    res.json({ user });    return;  }  const loginError = unwrapEither(loginEither);  res.json({ error: await handleLoginError(loginError) });});

That route handler is pretty easy to read at a glance now, and we've removed the mixed responsibilities and mixed levels of abstraction. The route handler maps the request payload to the command handler handleLoginUser, and then maps the outcomes to responses. It now has no business logic of its own, it is not responsible for managing attempts or checking passwords.

Our type system gave us the confidence we needed to easily refactor this code. It's easy to test, and we have managed to decouple our success types from our failure types. This means we don't always have to handle the entire discriminated union of possible outcomes, changing the error types only affects left sided code, and likewise changing the User type only affects right sided code.

Try out the Either pattern in your own functional TypeScript code and let us know how you went in the comments!

Strict & Weak Exhaustive Checks in TypeScript: Nuke your app at runtime for fun and profit!

Anthony Manning-Franklin — Sun, 17 Oct 2021 02:34:40 GMT

This article assumes you are familiar with the never type, exhaustive checks, and the concept of failing fast. If you aren't yet, or want a refresher, here are some resources to get up to speed:

Now I'm going to introduce you to two functions I wind up bringing into most TypeScript applications I work on, isStrictNever and isWeakNever

isStrictNever

export const isStrictNever = (x: never): never => {  throw new Error(`Never case reached with unexpected value ${x}`);};

This function will throw if it is ever called at runtime, and our compile-time checks ensure that this function should never be called.

isWeakNever

export const isWeakNever = (x: never): void => {  console.error(`Never case reached with unexpected value ${x} in ${new Error().stack}`);};

This function will only log at runtime, while still providing compile-time checks ensuring that this function should never be called.

Using `isStrictNever` to fail fast

Out of the two functions, isStrictNever is preferred in most cases because it helps us fail fast. One place I always use this is if I'm performing any mapping or checks at the database layer. Data consistency at runtime is key, and can easily cause your application to fall into an unpredictable state if the system attempts to continue operating with bad data.

Here's an example:

type Article = {  author: string;  body: string;  title: string;  type: ArticleType;  publishedAt: Date;};type ArticleType = 'STANDARD' | 'SERIES';export const selectLatestArticle = async (): Promiseundefined> => {  const res = await pool.query(`    SELECT id, author, title, body, type_id, published_at     FROM articles    ORDER BY published_at    LIMIT 1;  `);  return res.rows.map(mapArticleRowToArticle).shift();};type ArticleRow = {  author: string;  body: string;  title: string;  type_id: ArticleTypeId;  published_at: Date;};type ArticleTypeId = 1 | 2;const mapArticleRowToArticle = ({published_at, type_id, ...row}: ArticleRow): Article => ({  ...row,  publishedAt: published_at  type: mapArticleTypeIdToTypeString(type_id),});const mapArticleTypeIdToTypeString = (typeId: ArticleTypeId): ArticleType => {  switch (typeId) {    case 1: {      return 'STANDARD';    }    case 2: {      return 'SERIES';    }    default: {      return isStrictNever(typeId);    }  }};

Now, if I had a new article type id in the database, but forget to update the code to match, my application will crash whenever it retrieves an article using this new type.

You might think that sounds worse, but it's actually much better. This bug will likely cause end to end tests to fail completely. Because we failed fast, this change is unlikely to make it to production. Even if it does, we can easily track 50x coded HTTP responses in our observability system, we raise alerts when error rates spike, meaning our time to detection and time to recovery will be fast.

If instead we had simply let mapArticleTypeIdToTypeString return undefined, the impact might have been less obvious: it likely would have gone to production, and we would be dealing with disappointed product managers, a product that looks less professional, and have to prioritise bug tickets against new feature work. Given the option, I would much rather fail and fix up front.

Not to mention the fact that latent bugs are vectors for new bugs to interact and cause additional unexpected behaviours, further undermining the reliability of our application.

But a simple isStrictNever can save us all of that pain. So why would we ever use isWeakNever?

Using `isWeakNever` at the intersection of third party domains

Sometimes, we don't fully control the flow of data nor the implementation of types through our system. In these cases, we can use isWeakNever to create type systems that still help us enforce effective compile time checking for the parts of code we do control, while passing through unexpected runtime types from the code we do not control.

I often use this in a redux reducer, for example:

type CountState = {  count: number;};const initialState: CountState = {  count: 0,};type IncrementAction = {  type: 'INCREMENT';}type DecrementAction = {  type: 'DECREMENT';}type CountAction = IncrementAction | DecrementAction;export const reducer = (state: CountState = initialState, action: CountAction): CountState => {  switch (action.type) {    case 'INCREMENT': {      return { count: state.count + 1 };    }    case 'DECREMENT': {      return { count: state.count - 1 };    }    default: {      isWeakNever(action);      return state;    }  }};

Now if I introduce a new count action:

type SquareAction = {  type: 'SQUARE';};type CountAction = IncrementAction | DecrementAction | SquareAction;

But neglect to add a new case to the reducer, I will receive a compile time error because action has a possible value in the default case.

This compile time check gives me certainty that my implementation has completeness, while allowing the app to continue running when Redux sends its @@/INIT action, a middleware dispatches an action, or even if we have multiple reducers combined together.

Yes, the type system is lying to us a little, but it is a known, controlled, and measured lie. Logging the uses of isWeakNever gives us visibility into the things that slip through.

While Phryneas (AKA Lenz Weber) levies some valid criticisms towards the discriminated union approach to Redux reducers in his article, I believe this approach alleviates his concerns, while making the holes in the type system much more explicit.

This approach also enforces completeness through exhaustive checks, while his approach leaves these checks behind only to focus on payload type hinting.

In my opinion, exhaustive checks AND payload type hinting together are worthwhile, given an explicit indicator that the type system is incomplete, as usage of isWeakNever does.

Have another opinion on the matter? Share your thoughts in the comments below!

Dev Flags: Supercharge your Continuous Deployment by Dropping Database Feature Toggles

Anthony Manning-Franklin — Sun, 03 Oct 2021 03:39:16 GMT

Those of us who use Trunk Based Development as the foundation of our approach to Continuous Delivery or continuous Deployment are familiar with the use of Feature Toggles to prevent work in progress changes becoming prematurely available to customers while we develop them. To use the ThoughtWorks vernacular; release toggles.

A mistake I have seen many teams make, and regret, is managing these release toggles from the database. This approach has several issues:

Every release must test both paths, as the toggle could change at any time
Developers working on the code base have less visibility into which flags are currently enabled in which environments
Product Managers, developers, and testers often have to coordinate on the removal of feature toggles, this often leads to toggles simply not being removed
Following on from above, old feature toggles often stop being tested. This means every stale feature toggle is a liability that could cause an incident.
A full history of changes is either lost or needs tooling built to support it

Sure, many of the above issues could be solved with good tooling and strict processes, aside from #1. However all of these issues can be solved with tools your team has now: Source control & code.

By putting your toggles in code, you get the following advantages:

Changes only need to test the active code paths in each release, as they cannot change without another release
Changes to toggles have to go through the full SDLC/release pipeline and thus we can be confident they are working as expected
Developers can clearly see what features are enabled in what environments, who worked on them, and when they were changed
Developers can easily remove toggles once they have finished with them, as part of their feature clean up.

Because this approach seriously empowers developers to manage these toggles themselves, I believe this approach needs its own term: Dev Flags

The advantage of this name is that it is clear who is responsible for them: Developers! This is just part of how we get work done, Product managers don't need to be involved except to confirm they would like a new feature to go live in production.

This clearly delineates the costs involved in maintaining a feature toggle for other purposes, such as A/B Testing, Ops Toggles, and Canaries. By using two seperate systems, we can be clear with product managers, maintaining a new toggle will incur extra overheads on our time, impact our release cycles, and introduce an element of fatigue and complexity over time. This can help us ensure we collaborate on an effective plan to manage the lifecycle of our feature toggles when they are needed, and remove them as soon as they are no longer necessary.

How to implement Dev Flags? In projects using TypeScript for frontend and backend, I like to include these in a shared package. My folder structure might be:

And the contents of devFlags.ts:

type RuntimeEnvironments = 'dev' | 'stg' | 'prd';type DevFlags = {  useNewProfileScreen: boolean;  usePhoneCheck: boolean;};type DevFlagsEnvironmentMap = Record'stg'>, DevFlags>;const getEnvironmentDevFlagSet = (env: RuntimeEnvironments): DevFlags => {  const envs: DevFlagsEnvironmentMap = {    dev: {      useNewProfileScreen: true,      usePhoneCheck: true,    },    prd: {      useNewProfileScreen: true,      usePhoneCheck: false,    },  };  const flagEnv = env === 'stg' ? 'prd' : env;  return envs[flagEnv];};export const DEV_FLAGS = getEnvironmentDevFlagSet(env.RUNTIME_ENVIRONMENT);

Why do we exclude staging and instead treat it as prod? Because staging is where we confirm that the current deployment artefact is suitable for release to production. We can't do that if we're executing different code paths in staging and production.

In this one file I can see what features are in development, where they're active, who added which lines, and I can lookup usages to find these flags in the codebase.

As a rule of thumb, you want the number of dev flags in code to be equal to or less than the number of developers working in the code base. This requires some discipline to ensure you remove the flag after releasing the feature. Advantageously, developers are motivated to remember to remove flags because we like to keep our code clean, and we're empowered to do it, because it just another merge, same as we do all day.

Will you try Dev Flags with your team? Does your team do something else? Does it work for you? Let us know in the comments!

Cover photo: The Twin Jet Nebula, or PN M2-9, is a striking example of a bipolar planetary nebula. Bipolar planetary nebulae are formed when the central object is not a single star, but a binary system, Studies have shown that the nebulas size increases with time, and measurements of this rate of increase suggest that the stellar outburst that formed the lobes occurred just 1200 years ago.

Sagas & Event Sourcing

Anthony Manning-Franklin — Sun, 05 Sep 2021 12:20:25 GMT

By removing the constraint of atomicity for a transaction, we can break up a long lived transaction T into several transactions: $T_{1},T_{2},T_{3},T_{n}$

In order to maintain consistency, we must also provide compensatory transactions for failure cases: $C_{1},C_{2},C_{3},C_{n}$

In the context of Event Sourcing, this is an argument against achieving idempotent events through stateful payloads instead of delta payloads. To make this clear, consider the following two sets of event sequences:

Stateful Payloads

#	Event Type	Payload	Current State
1	Add	`{balance: 3}`	`{balance: 3}`
2	Add	`{balance: 5}`	`{balance: 5}`
3	Add	`{balance: 9}`	`{balance: 9}`
4	Subtract	`{balance: 6}`	`{balance: 6}`

Stateful payloads offer limited idempotency: if I receive the same payload twice, the balance remains the same. However, each event must take a full lock, as each event is dependent on the state of the last event. If two events are written concurrently, they will destructively interfere with each other. Because of this, stateful payloads:

Constrain write throughput
Have destructive compensatory events
Are inherently idempotent if sequentially constrained, provided events are never received out of order (e.g. receiving $E_{3}$ multiple times is non-destructive, unless we receive it again after receiving $E_{4}$ ).

The compensatory events are destructive, because if I were to write a compensatory event for $E_{2}$ at the time $E_{2}$ occurred it would be {balance: 3}. Applying $C_{2}$ after $E_{4}$ results in a state of {balance: 3} instead of {balance: 4}.

Stateful payloads are not a practical solution to idempotency because where there is more than once delivery, there is out of order delivery. Instead, make the consumer idempotent by keeping a log of processed events.

Lets consider the alternative, events that describe what delta to apply to the state to derive the new state.

Delta Payloads with Non Sequential Events

#	Event Type	Payload	Current State
1	Add	`{amount: 3}`	`{balance: 3}`
2	Add	`{amount: 2}`	`{balance: 5}`
3	Add	`{amount: 4}`	`{balance: 9}`
4	Subtract	`{amount: 3}`	`{balance: 6}`

As we can see the state at each step is the same. Now weve lost the limited idempotency of stateful payloads, but gained:

Improved write throughput
Non-destructive compensatory events

Lets create a compensatory event for each of these events:

Compensatory Events

#	Event Type	Payload
1	Subtract	`{amount: 3}`
2	Subtract	`{amount: 2}`
3	Subtract	`{amount: 4}`
4	Add	`{amount: 3}`

If I want to rollback $E_{2}$ I simply apply $C_{2}$ to the event stream, resulting in

#	Event Type	Payload	Current State
1	Add	`{amount: 3}`	`{balance: 3}`
2	Add	`{amount: 2}`	`{balance: 5}`
3	Add	`{amount: 4}`	`{balance: 9}`
4	Subtract	`{amount: 3}`	`{balance: 6}`
5	Subtract	`{amount: 2}`	`{balance: 4}`

This only applies to non-sequentially constrained event streams. As soon as a sequentially significant event is applied, the entire event stream becomes sequentially constrained, even if the other events are non-sequential on their own.

Here we can see set $E_{1},E_{2},E_{3},E_{4},C_{2}$ is equivalent to set $E_{1},E_{3},E_{4}$

This is because any sequence of $E_{n}$ is equivalent to any other sequence of $E_{n}$, and so is its complimentary set of $C_{n}$ and thus the set of any combination of the members of the two sets combined also be equivalent.

Delta Payloads with Sequential Events

#	Event Type	Payload	Current State
1	Add	`{amount: 3}`	`{balance: 3}`
2	Add	`{amount: 2}`	`{balance: 5}`
3	Add	`{amount: 4}`	`{balance: 9}`
4	Divide	`{amount: 3}`	`{balance: 3}`

Consider applying $C_{2}$ to the above event stream.

#	Event Type	Payload	Current State
1	Add	`{amount: 3}`	`{balance: 3}`
2	Add	`{amount: 2}`	`{balance: 5}`
3	Add	`{amount: 4}`	`{balance: 9}`
4	Divide	`{amount: 3}`	`{balance: 3}`
5	Subtract	`{amount: 2}`	`{balance: 1}`

However, our objective is to wind back the effects of $E_{2}$, so the desired outcome is

#	Event Type	Payload	Current State
1	Add	`{amount: 3}`	`{balance: 3}`
3	Add	`{amount: 4}`	`{balance: 7}`
4	Divide	`{amount: 3}`	`{balance: 2.3}`

In this case, we cannot guarantee a compensatory event will be non-destructive unless we can guarantee no sequential events have been inserted between $E_{2}$ and $C_{2}$. With sequential events in the mix, we can only make that guarantee if we lock the event stream, constraining write throughput and preventing entries between $E_{2}$ and $C_{2}$ to regain atomicity.

This leads us to the following assertions:

Compensatory events are non-destructive if both set $E_{n}$ and set $C_{n}$ are non-sequential.
$E_{n}$ is non-sequential if $E_{1},E_{2},E_{3}$ is equivalent to $E_{2},E_{1},E_{3}$ and any other combination.
A single sequential event in an event stream breaks the non-destructive compensation guarantee.

Caveats

Event stream sequentiality is derived from the combination of reducer and events. This means you could create a second reducer that processes events such that they are no longer non-sequential.
- If $R_{1}(E_{n})$ is non-sequential, you cannot guarantee $R_{2}(E_{n})$ is also sequential.

Practical Applications

When designing event sourced systems with long lived transactions modelled as sagas, you gain non-destructive compensatory events by ensuring your events are non-sequential. You only need to maintain this non-sequentiality constraint for events in the same stream processed by the same reducer. This means the projection derived from $R_{1}(E_{n})$ is unaffected by sequential events in stream $E_{b}$, ergo $R_{2}(E_{n+b})$ cannot non-destructively handle compensatory events in stream $E_{n}$ while $R_{1}(E_{n})$.

In a practical sense, reverting a sales order might re-apply deducted funds from a users account $R_{1}(E_{n})$ is non-destructive but it wont put an item back into inventory if it has already shipped $R_{2}(E_{n+b})$ is destructive.

The key takeaway here is to be mindful of your events are they sequential? If so, is it because there is some stateful information in the event payload that should be refactored into a delta? If it cannot be turned into a delta, you will need to consider the business implications of your compensatory event, i.e. booking a return parcel for a customer, dispatching an email, etc. It often becomes a matter of invoking a compensatory command or saga within the destructively compensated domain.

Functional Singletons in TypeScript With Real Use Cases

Anthony Manning-Franklin — Sun, 22 Aug 2021 06:28:51 GMT

Singletons are commonly used in Object Oriented Programming when we want to enforce that there is only ever a single instance of a class. This might be because we are trying to encapsulate some global state between processes.

In this article, I will use the queue from my es-reduxed package. The purpose of this queue is to:

Store a list of unprocessed event ids
Maintain the correct order of events
Ensure an event is only processed once

These assurances must be made because there is no guarantee that the events will be sent from the subscription only once and in order. While the system is processing one event, it may receive several more.

As you can guess, if we were to wind up with two instances of the event queue, we could no longer guarantee order and single processing constraints.

Let's take a look at the queue itself, starting with the types:

export type Queue = {  enqueue: (id: number) => void;  registerPromise: (id: number, resolver: PromiseResolver) => void;};export type PromiseResolver = (value: Store<any, any>) => void;

So we can see a Queue is just an object with two properties:

enqueue: a function taking a number and returning void
registerPromise: a function taking a number and a function

The registerPromise function is just associating a resolve function from a Promise with an event id so that the queue can resolve the promise with the given redux state when it finishes processing that event.

Let's take a look at the queue implementation:

/** * This queue system uses a recursive loop and a primitive state machine to * ensure that events are dispatched to redux in exactly the order they were * received. */const startQueue = extends EventBase>(  reduxStore: Store<any, any>,  eventsRepo: EventsRepo) => {  const queue: number[] = [];  const dedupeSet = new Set<number>();  const promiseMap = new Map<number, PromiseResolver>();  let state: 'READY' | 'PROCESSING' = 'READY';  const processEvent = (event: EventBase) => {    reduxStore.dispatch(event);    if (event.id === undefined) {      throw new Error(`Malformed event is missing id: ${event}`);    }    const resolver = promiseMap.get(event.id);    if (resolver) {      resolver(reduxStore.getState());      promiseMap.delete(event.id);    }  };  const processQueue = async () => {    if (state === 'READY') {      queue.sort((a, b) => a - b);      const eventId = queue.shift(); // So we only process if something was in the queue      if (eventId) {        state = 'PROCESSING';        if (queue.length) {          // More than one event in queue, so do bulk processing          const lastEventIndex = queue.length - 1; // Save queue length in-case it changes during the await          const lastEventId = queue[lastEventIndex];          const events = await eventsRepo.getEventRange(eventId, lastEventId);          events.forEach(processEvent);          queue.splice(0, lastEventIndex + 1);        } else {          const [event] = await eventsRepo.getEvents(eventId - 1, 1);          processEvent(event);        }        state = 'READY';        processQueue();      }    }  };  return {    enqueue: (id: number | string) => {      const idCoerced = typeof id === 'string' ? parseInt(id, 10) : id;      if (!dedupeSet.has(idCoerced)) {        dedupeSet.add(idCoerced);        queue.push(idCoerced);        processQueue();      } else {        console.warn(`Out of order event: [${idCoerced}]`);      }    },    registerPromise: (id: number, resolve: PromiseResolver) => {      promiseMap.set(id, resolve);    },  };};

Okay, let's break this down:

const startQueue = extends EventBase>(  reduxStore: Store<any, any>,  eventsRepo: EventsRepo) => {

The first thing to notice here is that we do not export this function, startQueue is available only inside the queue.ts module. This is a critical point we will come back to later.

  const processEvent = (event: EventBase) => {    reduxStore.dispatch(event);    if (event.id === undefined) {      throw new Error(`Malformed event is missing id: ${event}`);    }    const resolver = promiseMap.get(event.id);    if (resolver) {      resolver(reduxStore.getState());      promiseMap.delete(event.id);    }  };

This function is defined inside startQueue, so it is only available within the startQueue function. This is similar to a private method. However, it has access to all variables included in its closure. In this case, we are making use of promiseMap and reduxStore. This is similar to private properties in a class, but we use closures to make them inaccessible outside this context.

  const processQueue = async () => {    if (state === 'READY') {      queue.sort((a, b) => a - b);      const eventId = queue.shift(); // So we only process if something was in the queue      if (eventId) {        state = 'PROCESSING';        if (queue.length) {          // More than one event in queue, so do bulk processing          const lastEventIndex = queue.length - 1; // Save queue length in-case it changes during the await          const lastEventId = queue[lastEventIndex];          const events = await eventsRepo.getEventRange(eventId, lastEventId);          events.forEach(processEvent);          queue.splice(0, lastEventIndex + 1);        } else {          const [event] = await eventsRepo.getEvents(eventId - 1, 1);          processEvent(event);        }        state = 'READY';        processQueue();      }    }  };

Here we continually process the queue in a recursive loop, as long as there are events remaining and the queue is not already processing events. This ensures we only process events once. Because this function is async (it returns a promise), and because it will always call await before it recursively calls processQueue again, this function will not lock the event loop.

It's also important to realise that there might be calls to enqueue during the await step. This would grow the queue, but not be included in the call to getEventRange, so this is why we save the last event index before we await; otherwise, we could accidentally splice out events we hadn't processed yet.

Now that we understand the queue's "private methods" and properties, let's take a look at the "public methods" in the return statement:

  return {    enqueue: (id: number | string) => {      const idCoerced = typeof id === 'string' ? parseInt(id, 10) : id;      if (!dedupeSet.has(idCoerced)) {        dedupeSet.add(idCoerced);        queue.push(idCoerced);        processQueue();      } else {        console.warn(`Out of order event: [${idCoerced}]`);      }    },    registerPromise: (id: number, resolve: PromiseResolver) => {      promiseMap.set(id, resolve);    },  };

Aha! So when we enqueue an event, we call processQueue if it is an event we haven't seen before. Remember the state checks in processQueue? We can safely call this function here because it won't do anything if there is already a process running, and it will eventually get to our enqueued event through the recursive loop.

Meanwhile, registerPromise maps a promise to the event id, which will be used later. In this implementation, we only allow resolving one promise per event processed.

Let's get to the meat of this article, the function that instantiates or retrieves this queue as a singleton:

export const getQueue = (() => {  let instance: Queue;  return extends EventBase>(    reduxStore: Store<any, any>,    eventsRepo: EventsRepo  ) => {    instance =      instance === undefined ? startQueue(reduxStore, eventsRepo) : instance;    return instance;  };})();

First, take note of the fact that this is the only run-time export from queue.ts. You can only retrieve a queue by calling getQueue, but what's actually happening here?

export const getQueue = (() => { // Trimmed})();

Above is an immediately invoked function expression. We are defining a function and then calling it. The result of this function is then assigned to the variable getQueue. Well, getQueue is a function -- we can tell because the first word is a verb -- so this immediately invoked function expression needs to return a function.

(() => {  let instance: Queue;  return // Trimmed})();

Before it returns, we declare a mutable variable called instance of the type Queue but do not define a value for it, which means it will be undefined.

(() => {  let instance: Queue;  return extends EventBase>(    reduxStore: Store<any, any>,    eventsRepo: EventsRepo  ) => {    // Trimmed  };})();

Aha! So after we declare our instance variable for storing a queue, we define a function for our immediately invoked function expression to return. This function signature should look familiar -- it is identical to startQueue's function signature.

(() => {  let instance: Queue;  return extends EventBase>(    reduxStore: Store<any, any>,    eventsRepo: EventsRepo  ) => {    instance =      instance === undefined ? startQueue(reduxStore, eventsRepo) : instance;    return instance;  };})();

The final step: Inside the function returned to getQueue by our immediately invoked function expression, we will: If instance is undefined, call startQueue, passing in our generic type parameter, reduxStore, and eventsRepo, OR, in the case that it is defined, we simply leave instance as instance. Then we return instance.

This means that instance is stored in the closure scope of getQueue. It is no longer accessible anywhere in javascript except by getQueue itself. We can be confident that getQueue will only ever return a single queue instance. The first time it instantiates the queue, and subsequent calls return it.

You can test this fact by asserting:

expect(getQueue()).to.equal(getQueue());

This will return true because both calls return a reference to the same object!

console.log(getQueue() === getQueue()); // true

Remember object equality in javascript compares references.

There's a massive caveat in this implementation; can you spot it? We only use the parameters the first time getQueue is called! This means that if I try to start one queue for one store and another queue for another store, it will simply ignore the second store and return my queue for the first store.

In this way, this pattern is not memoization. If we were using memoization, we would be able to create a singleton per combination of reduxStore and eventsRepo. However, this makes it harder to enforce a singleton pattern. What defines whether reduxStore and eventsRepo are the same as the last call? Would we compare object equality? Deeply nested properties?

For example, if I were to use a proxy-based memoization implementation such as proxy-memoize then changes to properties of, or child properties of reduxStore and eventsRepo would cause calls to getQueue to return a new queue instance! Uh oh!

I like to think of this pattern as brutal memoize. You get to provide your parameters once, and after that, we ignore them.

Will you be using this functional singleton pattern in your code? Do you have a different approach? Let me know what you think in the comments!

Photo by Michael Dziedzic on Unsplash

Why You Will Never Write Another "Down" Migration

Anthony Manning-Franklin — Thu, 12 Aug 2021 13:56:41 GMT

If you've used frameworks like Rails, Django, or even Hasura, you are probably familiar with the concept of "up" migrations and "down" migrations. These might also be called forwards and backwards, or in flyway they are called "undo" migrations.

For the unfamiliar, the migrations we are talking about here are a set of schema changes for a relational database such as Postgres, MySQL, etc. Sometimes these are generated by an ORM based on models, and sometimes they are written by developers. They're almost always (and should be) committed to version control systems, and must be executed in a particular order.

Our "up" migrations might create some tables, alter some columns, set constraints, and so forth. This is normally a straight-forward affair. Developers do this day in and day out, we test these on our local before committing them, and if we're going to run alterations on existing tables, we usually test them out on prod-like data to measure impact. We're confident about up migrations.

My case against down migrations starts rather predictably; when do you run them? Hardly ever. How confident are you that every down migration in source control was tested before it was executed? Was it tested after prod-like data was inserted into the table?

I know the answer to both those questions is "Oh s#%T!" or an amused but panicked chuckle to yourself. The fact is that when it comes time to fix a schema problem in production, those down migrations you've been saving for a rainy day? There is NEVER a situation where they are the right solution.

Teams sometimes laugh nervously amongst themselves that their disaster recovery plans are an entirely unknown quantity because they've never tested them. Down migrations are like hundreds of untested disaster recovery plans, only much much worse.

Putting down migrations in source control, and then deploying them to production, is like enjoying a TV dinner with unexploded ordnance on the coffee table. At some point, someone is going to mistakenly run "down" in prod thinking it was dev/local/etc. At least you've practiced your disaster recovery though, right? Hey how often do your automated backups run and when did you last check them? Go, I'll be here when you get back!

...Oh you're back? Are you okay? You look stressed! Anyway, even if you do successfully run a down migration at some point and it proves useful... now what? Your migration history no longer makes sense. The fact that X, Y, or Z was run in production to change the schema in some particular way and unpick some mess, is lost to history.

I propose you forget all about down migrations. Delete them all from source control right now, I promise you no one will miss them.

What happens if you do make a change you want to rollback in production? Ah yes, you carefully figure out how to roll forward! You write a new "up" migration to clean up your mistake. That's right, you write your drop table or alter column statements in an up migration. You prepare a run sheet, you test, and you go through your normal SDLC for deploying schema changes (You do run migrations via CD right?) Now your changes are in source control. They're part of history, as they should be, because it happened, and all future migrations will safely run on top of that history.

So what do you think? Are you going to delete your down migrations from source control? I'd love to hear about it if you do! Come back and tell us how it went in the comments!

Photo by Avery Nielsen-Webb from Pexels

Event Sourcing On a Complexity Budget

Anthony Manning-Franklin — Wed, 11 Aug 2021 10:48:52 GMT

If you've ever heard Greg Young talk about Event Sourcing (ES), or met one of its enthusiasts, you've probably never gone from "Huh?!" to "Holy S#%T!" faster in your life. And then promptly had an overwhelming sense of dread at the concept of actually implementing an Event Sourced system.

It sounds great! It is great. But how are you supposed to get a team of engineers onboard with this idea? How do you get everyone up to speed? How do you balance that with delivering value to the business fast enough that they don't panic? How are you supposed to implement it without causing a catastrophic business failure? Everyone on the team knows how CRUD works. You've been reading about ES for a week/month/aeon with diminishing returns of increased confidence.

Enter the complexity budget.

Put simply, a complexity budget is constraining yourself to a limited amount of complexity, by assigning various elements of your problem and solution a comparative value of complexity. Then, you budget how much complexity you're willing to introduce to the team over a given period, such as quarterly.

Let's say we wanted to introduce Event Sourcing using Greg Young's EventStoreDB, and build read models or projections in Postgres. That's going to require:

Deploying and managing our own database instance (No managed database service)
Learning a new API to interact with that database
Learning Event Sourcing
Learning the ES database model to build and manage event stores
Learning to write ES code
Figuring out how to deploy and manage read model populators
Figuring out how to implement observability and monitoring of read model populators
Figuring out how to deploy new versions of read models and their populators alongside new ones
Figuring out how to version events

That's a lot for a team to learn, especially if you don't already have an event sourcing expert.

If I had to get a team to grok all of that before trying a new approach, while delivering business value, it would take months even with a well developed L&D program, if not longer.

But what if we broke it down? Can we apply some lean principles to this and create some achievable learning loops?

Let's say our team has a learning velocity of 15 complexity points per quarter. This is "stick a finger in the air and guess" stuff, since we don't measure learning velocity like we measure sprint velocity. You need to rely on an intuitive sense of what's achievable for your team. I like 15 points because if 13 points is one really complex topic, it means we can teach our team one very complex thing each quarter, or a handful of small and medium complexity topics.

We want the 15 points to signify the most our team can learn to use at a production grade level each month. Remember, our craft is learning. We have incoming learnings from product, from new technologies, from new approaches, from keeping up with our colleagues. Every engineer has a learning velocity. Your team's learning velocity is that of the engineer with the lowest learning velocity. If it isn't, you're not a team.

How do we assign complexity values to the list above? Let's start by setting some examples with arbitrary relative values using reference points many people could be familiar with.

Learning SASS: 1 complexity point
Learning Reactive Extensions for Redux: 8 complexity points

Now we'll assign relative values to the tasks above:

Task	Complexity Score
Deploying and managing our own database instance (No managed database service)	8
Learning a new API to interact with that database	3
Learning Event Sourcing concepts	8
Learning the ES database model to build and manage event stores	3
Learning to write ES code	3
Figuring out how to deploy and manage read model populators	8
Figuring out how to implement observability and monitoring of read model populators	3
Figuring out how to deploy new versions of read models and their populators alongside new ones	3
Figuring out how to version events	2

That's a total of 41 complexity points. How can we fit this into a 15 point budget?

Well what if instead of deploying EventStoreDB, we use an append only Postgres table? That takes 14 points off the total.

Another 14 points of complexity is in regards to read model populators. There's a way to remove that complexity too, which I will explain shortly.

That just leaves us with:

Task	Complexity Score
Learning Event Sourcing concepts	8
Learning to write ES code	3
Figuring out how to version events	2

These are the foundational components of event sourcing and a total of 13 points!

How do we remove read model populators from the equation? There's actually a perfectly good pattern for building state derived from a series of events that many developers are familiar with: Redux.

Stay with me for a minute, this is all it requires:

An event store in an append only database table
Postgres notification topics and subscriptions
Replaying events on deployment to rebuild state
Replace Redux style actions with Event Sourcing events

In exchange you get:

The ability to completely rebuild your data model on deploy as business requirements change
No need for database migrations
Happy data analysts as your events are now the record of truth about significant business events
Happy data analysts because software engineers are now considering event data upfront as it is operationally significant and required to ship features
You keep writing and deploying applications largely as usual, High Availability, scaling, observability, all operate as usual

So what about the complexities of a system like the one above? What about snapshots? Ensuring correct ordering of events? Setting up the database table, saving events, and just generally hooking up a Redux implementation to a web server and streaming events?

Well luckily most of that has been encapsulated in a package I've created called es-reduxed.

It ensures that:

Events are dispatched once from a single instance per request
Events are received by all application instances
Events are processed in order
Events are replayed on deployment
The application does not start serving web requests until it has caught up

All consumers of the package need to do is:

Write reducers
Ensure GET requests read from the Redux state
Ensure POST requests dispatch events

However if you want to maintain a RESTful API that returns the updated resource, we've even included the ability for the raiseEvent function to asynchronously wait until the new event has been replayed through the reducer, and return the updated state via a Promise!

I've built an example application using this approach, you can:

Play with it on this demo app
Check out the code here on GitHub

In the long term

This isn't a permanent solution for an event sourcing system. It's a stepping stone to reduce the pain of getting started and build production-level experience on your team.

Those 14 complexity points for read model populators? Come back to it after you've got the system running. Once you can see what the usage patterns are, you can start thinking about what projections could be built as database tables in the 3rd normal form by read model populators.

That might even fit nicely within your complexity budget for the next quarter.

Caveats

The first caveat I have is that I'm yet to take this approach on a very large production application. However, that's also not the ideal use case. The best use case for this approach is a green fields system -- something where you can afford to make some mistakes, in production, while the team learns. As long as the events in your event store have all the data you need, there's nothing tying you to this approach in the long or even medium term. That said, my team is currently trying this approach in a production system at the moment. My experience so far has been positive! Even with my attempts to break it.

Also, this approach clearly won't work in some scenarios. If your application state is too large to fit in memory, you will need to persist some or all of the projection to a database or cache. That might take Redux off the table and reintroduces lots of complexity.

If you can slice a small domain and build confidence that way, maybe you can use it in a limited scope. Just remember that you probably want to start with a single event stream for a domain, and an event stream aligned with your transactional boundary. Otherwise you're going to introduce distributed transactions, compensating events, and complexity will quickly snowball again.

Conclusion

So do I recommend building a production system this way tomorrow? No. Build a couple of practice applications first. Get a feel for it.

Let me know how you went in the comments!

Photo by Fiona Art from Pexels

Solve Recursive Problems Faster With This Approach

Anthony Manning-Franklin — Sun, 08 Aug 2021 14:57:53 GMT

If you write functional code, you will encounter problems where a recursive algorithm is an appropriate solution. However, if you've come from an Object Oriented background or you are new to functional programming, you might struggle when it comes time to write your recursive function. I'm going to give you a simple framework for solving these recursive problems.

Understand Your Types

In a recursive function, you always have some input value, some form of work, and some output value. How you return that output might vary, i.e. if you're appending to or mutating some collection, or returning a primitive value from nested calls. Since the latter is easier to type and reason about, let's start there.

Starting with a simple factorial function, the problem domain is maths, so the input and return types will be numbers.

type CalculateFactorial = (factor: number) => number;

Now that I know my function needs to take a number and return a number, lets implement it:

const calculateFactorial: CalculateFactorial = (factor) => {  if (factor === 0) return 1; // !0 returns 1 because maths  return factor * calculateFactorial(factor - 1);}

Yes, this does not have tail call optimisation nor is it in a recursive trampoline, but we're building our mental model of recursion step by step. Hold on, we'll get there.

Let's try a problem wherein we want to traverse some tree structure in our function. In this case, I have a set of nested JSON objects.

type Leaf = {  nodeAddress: string;  data: string;}type Branch = {  nodeAddress: string;  children: Array;}

Well just looking at my types I can already see a base case and recursive type structures!

Lets have a crack at writing this type of recursive function:

const traverseTree = (nodeAddress: string): Branch | Leaf => {  const node = getNodeByAddress(nodeAddress);  const isLeaf = checkIsLeaf(node);  if (isLeaf) {    return {      nodeAddress,      data: getNodeData(node),    };  }  return {    nodeAddress,    children: getNodeChildrenAddresses(node).map(traverseTree)  };};

So here we can see the function follows the type if it's a leaf node, we get the data, and if it's a branch node, we get the addresses of its child nodes and map over them with our traversal function. By taking time to consider our types, we've reduced our cognitive load while writing the function.

Check your base case

In every recursive function we have to have a base case. This is the scenario where recursion stops because the function no longer calls itself.

In the first function it was if (factor === 0) return 1;In the second function it was

  if (isLeaf) {    return {      nodeAddress,      data: getNodeData(node),    };  }

You may have been told in the past that you should start by defining your base case in a recursive function, but as we saw above, it can be useful to start by defining your types first. You can't define a base case if you haven't grokked the data structure you're working with.

We saw in the first function that we were taking in and returning numbers, so the base case must be some sentinel number. In the case of factorial, that sentinel value is 0.

In the second function we saw that we had a recursive data structure, so our base case had to be the leaf nodes. This also informed us that our function returned two different things a Leaf or a Branch.

Define the business logic

A recursive function always involves some work, some business logic. In the simplest case it is a single expression, such as in the factorial solution, where the work is the multiplication operator. In a more complex case, the work may involve mapping, transforming, calculating, asynchronous calls, and more.

Try to be clear in your mind about which parts of the function contain the business logic, and which parts contain the recursion logic. Whenever possible, separate them in code through spacing and ordering.

Optimise

The above implementations have some limitations. They will create a new stack frame for every call, causing them to crash in some cases, in addition to incurring a higher memory cost. In some cases we can use Tail Call Optimisation. This occurs when the recursive function call is in the return position, e.g.

function factorial(factor: number, product = 1): number {  if (factor === 0) return product;  return factorial(factor - 1, product * factor);}

In this version of factorial, we pass the result of the work into the next function call, which is in the return position. To make it even clearer:

function factorial(factor: number, product = 1): number {  if (factor === 0) return product; // Recursion logic  const newFactor = factor - 1; // Work AKA business logic  const newProduct = product * factor; // Work   return factorial(newFactor, newProduct); // Recursion logic}

Above we can clearly see that when we call factorial at the end, the calling stack frame is not needed anymore. We've finished with all the variables inside this stack.

If we tried to arrange the original function this way:

const calculateFactorial: CalculateFactorial = (factor) => {  if (factor === 0) return 1; // Recursive logic  const newFactor = factor - 1; // Work  return factor * calculateFactorial(newFactor); // Work & Recursion logic}

You can see we have a dangling variable factor in the stack frame awaiting the result of the call to calculateFactorial(newFactor) before we can evaluate the multiplication expression and return. This means we have to hold onto this stack frame until the recursion completes!

However, since Nodejs 8 we cannot use Tail Call Optimisation, so what can we do instead?

Recursive Trampoline

A trampoline is the function we use to essentially manage our own stack frames using closures. Here's a very generic trampoline we could use with any function. We're about to use it to "trampoline" the factorial function:

const createTrampoline =      (fn: any) =>      (args: any[]) => {        let result = fn(args);        while (typeof result === 'function') {          result = result();        }        return result;};

This function will take a function as its first and only parameter. It then returns a new function that will call the function we passed it before, and in this case passing through the parameters we supply it. After calling our supplied function (fn), it then checks if the result of our call was a function, if it is, it continues calling it in a loop until it gets a result that is not a function.

Now we have to change our implementation of factorial, just slightly, for this to work:

function factorial(factor: number, product = 1): number | (() => ReturnType<typeof factorial>) {      if (factor === 0) return product;      return () => factorial(factor - 1, factor * product);}

All we have changed in the function body is that the continue case now creates an anonymous closure function that, when called, will call factorial again with the new values. It now returns this closure function instead of returning a value.

Two important things to notice, based on what we've learned so far:

The return type now includes a recursive type! It can return number or a function calling and returning itself.
The returning closure uses tail call optimisation!

So you can see we create a new function that includes our new factor and new product in the function closure, but each instance of factorial is removed from memory before the next call is executed in our trampoline. Neat eh?

In practice, this winds up looking like trampoline(factorial)(5) where we immediately invoke the function returned by trampoline.

Example Test Suite

Here's the test suite I used for this article:

import { expect } from 'chai';describe('recursive factorial', () => {  it('factorial without TCO', () => {    function factorial(n: number): number {      if (n === 0) return 1;      return n * factorial(n - 1);    }    expect(factorial(5)).to.equal(120);    expect(factorial(100)).to.equal(9.33262154439441e157);    expect(() => factorial(200000)).to.throw(RangeError);  });  it('factorial with tail call optimisation', () => {    function factorial(n: number, product = 1): number {      if (n === 0) return product;      return factorial(n - 1, product * n);    }    expect(factorial(5)).to.equal(120);    expect(factorial(100)).to.equal(9.332621544394418e157);    expect(() => factorial(200000)).to.throw(RangeError);    // Node has dropped tail call optimisation since node 8    // This may work in some browsers,  });  it('factorial with a recursive trampoline', () => {    const trampoline =      (fn: any) =>      (n: number): number => {        let result = fn(n);        while (typeof result === 'function') {          result = result();        }        return result;      };    function factorial(      n: number,      product = 1    ): number | (() => ReturnType<typeof factorial>) {      if (n < 1) return product;      return () => factorial(n - 1, n * product);    }    expect(trampoline(factorial)(5)).to.equal(120);    expect(trampoline(factorial)(-1)).to.equal(1);    expect(trampoline(factorial)(100)).to.equal(9.332621544394418e157);    expect(trampoline(factorial)(200000)).to.equal(Infinity);  });});

Why don't you try to write your own implementation of a recursive summation function?

Here's the same three test cases, but of course I've changed out the expects and removed the function implementations.

import { expect } from 'chai';describe('recursive sum', () => {  it('sum without TCO', () => {    function sum(numbers: number[]): number {      // Write me!    }    expect(sum([5, 3, 11, 13])).to.equal(32);    expect(sum([100, 230])).to.equal(330);    expect(() => sum(Array.from({ length: 200000 }, (_, i) => i))).to.throw(      RangeError    );  });  it('sum with tail call optimisation', () => {    function sum(numbers: number[], summation = 0): number {      // Write me!    }    expect(sum([5, 3, 11, 13])).to.equal(32);    expect(sum([100, 230])).to.equal(330);    expect(() => sum(Array.from({ length: 200000 }, (_, i) => i))).to.throw(      RangeError    );    // Node has dropped tail call optimisation since node 8    // This may not throw in some browsers,  });  it('sum with a recursive trampoline', () => {    const trampoline =      (fn: any) => (/* add params and types */) => {}; // Write out the implementation of trampoline yourself, it will help you grok it    function sum(      numbers: number[],      summation = 0    ): number | (() => ReturnType<typeof sum>) {      // Write me!    }    expect(trampoline(sum)([5, 3, 11, 13])).to.equal(32);    expect(trampoline(sum)([100, 230])).to.equal(330);    expect(      trampoline(sum)(Array.from({ length: 200000 }, (_, i) => i))    ).to.equal(19999900000);  }).timeout(10000); // This may take a while if you use `shift`});

See if you can get the tests to pass, and let me know how you went in the comments!

Your Team Won’t Thrive if it Cannot Teach

Anthony Manning-Franklin — Sun, 25 Jul 2021 14:24:02 GMT

Teaching others to teach is the single most important thing a leader can do.

In business, the organisation that learns the fastest, wins. In order to learn fast, we dont just conduct experiments on customers. We have to disseminate knowledge amongst our peers. We have to integrate new data and new models to form new understanding, enabling us to produce new insights. We have to teach our peers the norms, processes, expertise and expectations of both our culture and our craft.

In this sense, the more senior your position, the more crucial it is that you teach others how to teach. You must explain, support, and most importantly, demonstrate teaching behaviour.

Since the DevOps revolution more companies pay lip service to a learning culture. They pay for a book stipend, maybe pay for some course materials or certifications, maybe even pay for people to attend conferences. But the successful companies dont create a learning culture at all. They create a teaching culture.

The best learning we can do is among peers. Thats when the books weve read become real. Thats when ideas become actions, and thats when individuals synthesise new culture and new institutional knowledge. When they form groups in learning & teaching loops.

I think the greatest thing I ever learned as a manager was when I was taught to execute on a learning and development program. And I think the single greatest thing I implemented as a manager was a weekly learning and development sync up that included mob learning amongst peers.

Engineers would fire up their attempts at various coding exercises, and clamour around the monitor is groups. Each debating the merits and alternatives to the code on the screen. Everyone in those sessions acted as both the mentor and the student. Quickly, the group self organised and ran itself.

Learning isnt passive. Its active. It happens when we teach and it happens when we attempt to recall and utilise what weve studied.

So how do we create a teaching culture? How do we teach others to teach? Honestly I dont know for certain, Im not an expert, yet. Heres what Ive found so far:

Start with practices

Start with a consistent practice. Culture starts with habits, routines, or practices amongst groups. With repetition, those practices form shared ideas, experiences, and mental frames for interpreting events amongst groups. These are the intangible artefacts of culture. The phrase fake it til you make it is apt here. Once a practice becomes culture, its self perpetuating and may out live your tenure.

Create a vision

Reframe your learning and development program as a teaching and development program. Set the expectation that all were going on a journey of learning to teach. Explain why this is important by creating a vision of the culture in three years time. Make it visceral, make it real.

Create opportunities for teaching

Establish regular skill shares. Its important that these address two very different contexts:

Your close team, where people have shared context and are likely to more easily understand you
Another team, where you will have to learn to reframe your message and unpack the underlying assumptions that made your talk relatable to the first group

It helps if individuals present their skill shares in this particular sequence. It enables people to formalise their ideas in layers, similar to how its structured in our brain. First we integrate knowledge by linking it to what we already know. Then we build understanding by reframing it piecemeal into its own set of layers and ideas.

Take care that company wide skill shares dont become an interdepartmental propaganda machine.

Create opportunities for mob learning

Create challenges specific to your craft. Im a software engineer, so in my world thats coding challenge to write an application that passes a suite of automated tests. The important thing here isnt only the individuals repeatedly practicing and improving their craft against the challenge, its everyone coming together to teach each other what they learned.

Support diverse teaching styles

Not everyone can teach in the same way. I can write an article, and I can sit down with someone one on one and guide them through new learning, especially in a peer programming context. I cannot, for the life of me, present a well structured talk. Which is funny because I can do a killer pitch for a startup, no sweat. But give a presentation on a topic I'm knowledgable about? Painful for everyone involved.

If I can give an unstructured talk, however? Brilliant. Get me talking about a topic I'm enamoured with in front of 4 or 5 peers and I am off! Full info dump.

Perhaps you have a lot of writers on your team? The approach then may be to create a writers guild sharing and revising each others drafts before publishing it on the company blog.

Create a practice of considering others

Set teaching objectives for your team members, as part of their personalised Learning/Development Plan.

That is, sit down with them and have them pick a more junior team member to coach towards some goal. It might be that they can work independently with a particular technology, or achieve a particular promotion within a certain timeframe. Then get them to work backwards, ask them what this junior team member will know when they've achieved this goal. What foundational skills or knowledge do they need to be able to do those things? What related or complimentary skills are involved? e.g. If I know how Event Sourcing works, that means I'll also need to know about concurrency, locking, read model populators, transactions & transaction boundaries, eventual consistency & CAP Theorem, and so on. Have them work backwards in reverse chronological order, and they will have created a framework for teaching their peers.

This of course requires a lot of empathy, so you may have to help them imagine themselves as their peer, with their knowledge and experiences. Teaching others requires sophisticated modelling of other minds, which happens to greatly enhance intra-team communication.

The goal here is that once youve created a teaching culture, it becomes part of the cultural norm. Its no longer cognitively taxing for team members to teach each other because theyve had plenty of regular practice. Now, when an incident occurs and you need to run a Post Incident Review (PIR), the team has supportive, coaching, teaching patterns to fall back on. A blameless PIR is more likely to bear unspoilt fruits if the people in it are prioritising learning & teaching. Or say the team introduces a new technology or approach, now they have a constructive pattern to fall into for disseminating that knowledge. With time, practice will become culture, and just the way your team gets things get done.

If you have any experience, ideas, or suggestions, please share them in the comments! Teaching others to teach seems to be the core management skill that is talked about least.

Runtime typing with http-schemas

Anthony Manning-Franklin — Wed, 02 Jun 2021 06:37:33 GMT

http-schemas is a library for defining an API schema. It comes with utilities to help you use and enforce this schema in both your client and server codebases.

What's wrong with the way we write web apps normally?

API boundary is poorly defined in code: Adding duplicate type definitions in client and server creates an opportunity to introduce bugs with every change, as these definitions can fall out of sync. It often relies on casting with as more than strict enforcement, which creates an opportunity for runtime errors. For example, axios may accept a generic to define the response payload type, but it doesn't enforce that the received data actually conforms to the type. In this case, Axios gives you a footgun to create runtime bugs with.
Often no strict validation of the boundary: i.e. if i send the wrong payload, such as a string instead of a number, or extra properties, or missing properties, will your app error? with a 500 or a 400? Or worse, carry on incorrectly? Will it give meaningful feedback?
Expensive to write custom validation: How much discipline does it take to write extensive validation for every endpoint? How long do you spend on it? Sure it's valuable, but it isn't a high leverage way to help the business deliver on its core value proposition, not compared to writing business logic.
Poor developer experience: How often have you read code to find out what the payload for a request or response should be? Even checking a swagger doc is less than ideal when coding against/for your own API. Especially since a swagger doc cannot guarantee correctness against the actual application.
Types are contagious: A poorly typed API boundary causes a proliferation of unknown or any types throughout your code base.
Less resilient application code: Good types help illuminate error cases, edge cases, and flaws in our implementation of business logic. A lack of these types and a proliferation of any types leads to code that is often missing code branches and thus buggy.

How does http-schemas help?

Reduce duplicate type definitions: Define a well typed API schema once for your entire application:.
Improved developer experience: Use this API schema across client and server with fantastic type hinting and checking. Now a lot of runtime errors are instead raised at compile time, saving developers and testers time.
Positive contagion: Use the types from your API schema at every level of your codebase -- from route handlers, to domain logic, to your persistence layer. Now these types are a positive contagion proliferating through your code base.
Reduce the temptation to use any and unknown: The broken windows theory from clean code applies here. If your code has lots of well defined types, any new code submitted containing any is going to stand out, helping you maintain code quality.
Ease refactoring: Update a type at the API boundary and you are forced to update code in both the client and server, less you get a compile time warning.
Validate the API boundary at runtime: http-schemas validates your request payloads at runtime. It checks that they match the types specified in the API schema and will return a 400 when it receives a bad payload. This has a huge impact on the reliability of your application, as it is now much less likely to attempt to continue operation when it receives unexpected data, which could lead to bad application state or even corrupt data in your data store.
Improve application observability: Measure and report on status codes to see errors such as a bad client release show up as an increase in 400s. This will improve your time to detection, mean time to recovery, and change failure rate as it shines a light on any accidental breaking changes to your API schema.

For example, we once released a breaking change to a trivial API in our application. We knew it would impact customers only very mildly, and when we did release it, we immediately saw a huge spike in error rates as users with the old client were hitting the new endpoint. This spike asymptotically returned to a nominal level as users refreshed their browsers. If this had been an unexpected change to a critical endpoint, the resulting PagerDuty alarms would have allowed us to respond very quickly.

If it had been a vanilla express application, we might never have noticed.

Creating an application with http-schemas

I built a simple web app using http-schemas to demonstrate its value. I've attempted to avoid an overly contrived example, so I built an application that allows users to create polls where other users can vote on and contribute to choices. I only allowed myself one small concession -- to save time and simplify re-use by others, it uses an in-memory data store for its persistence layer!

View the app: https://http-schemas-demo.antman-does-software.com
View the code: https://github.com/Antman261/http-schemas-webserver-demo

The folder structure

The easiest way to set this up is to structure your application as a pseudo-monorepo. We do this by creating three main folders inside a thin top-level package.

client
server
shared/http

Installing the API Schema in client and server

The crucial step is to install the package defined in shared/http as a package inside client and server. NPM is able to install packages from the filesystem using symlinks. Provide the path to the package as an argument to npm i as follows: npm i ../shared/http

NPM installs the package in your node modules via a symlink and adds an entry to your package.json as follows:

  "dependencies": {    "compression": "^1.7.4",    "cors": "^2.8.5",    "express": "^4.17.1",    "helmet": "^4.6.0",    "api-schema": "file:../shared/http", // <= this bit here    "http-schemas": "^0.9.1",    "morgan": "^1.10.0"  }

We've set up shared/http as a typical package with a src directory containing TypeScript and a lib directory containing compiled javascript with generated type declarations and type maps.

  "name": "api-schema",  "version": "1.0.0",  "description": "",  "main": "lib/index",  "types": "lib/index",

The shared API schema

Inside shared/http/src we have three main files that define our API Schema:

types.ts
schema.ts
index.ts

Types

All of the domain types for our API are declared in shared/http/src/types.ts. The domain model here is relatively simple: Polls contain choices, and choices have a count of votes. However, we model this domain with two types, the domain type and the input type.

Lets consider what the ideal domain type would be for a Poll:

export const Poll = t.object({  text: t.string,  type: PollTypes,  choices: t.array(Choice),  id: t.number,});

This looks pretty good, but lets consider the ideal POST request to create this object:

POST /api/polls HTTP/1.1{  "text": "Is this poll good?",  "type": "OPEN",  "choices": [    "Yes", "No", "Maybe"  ]}

We don't yet have an ID, nor do we need choiceId or a count of votes for the choices array all we need is a string of the text for the choice.

Instead we create a simplified input type:

export const PollInput = t.object({  text: t.string,  type: PollTypes,  choices: t.array(ChoiceInput),});

What about consistency between input types and domain types? We use the spread operator to reuse the relevant types in the domain type and override or add the delta:

export const Poll = t.object({  ...PollInput.properties,  choices: t.array(Choice),  id: t.number,});

Schema

We declare the API schema in a single declarative file in shared/http/src/schema.ts

import {createHttpRoute, createHttpSchema, t} from "http-schemas";import {ChoiceInput, Poll, PollInput} from "./types";export const ErrorBody = t.object({error: t.string})export const pollsApiSchema = createHttpSchema([  createHttpRoute({    method: 'GET',    path: '/polls',    responseBody: t.object({      polls: t.array(Poll),    })  }),  createHttpRoute({    method: 'GET',    path: '/polls/:id',    paramNames: ['id'],    responseBody: t.union(Poll, ErrorBody),  }),  createHttpRoute({    method: 'POST',    path: '/polls',    requestBody: PollInput,    responseBody: Poll  }),  createHttpRoute({    method: 'POST',    path: '/polls/:id/choices',    paramNames: ['id'],    requestBody: t.object({text: ChoiceInput}),    responseBody: t.union(Poll, ErrorBody),  }),  createHttpRoute({    method: 'POST',    path: '/polls/:id/choices/:choiceId/vote',    paramNames: ['id', 'choiceId'],    responseBody: t.union(Poll, ErrorBody),  })]);

This defines the entire API schema in one place and returns a pollsApiSchema object we have exported for use later. In this file we don't declare the types that model our domain simply for code clarity sake. This file merges the domain types with HTTP types such as ErrorBody and declares their usage amongst a collection of routes.

Index

In shared/http/src/index.ts we collect and export only the types and objects we will need in our server and client code:

import { TypeFromTypeInfo } from "http-schemas";import {ErrorBody, pollsApiSchema} from "./schema";import {Poll, PollInput, ChoiceInput, Choice} from "./types";export type Poll = TypeFromTypeInfo<typeof Poll>;export type PollInput = TypeFromTypeInfo<typeof PollInput>;export type ChoiceInput = TypeFromTypeInfo<typeof ChoiceInput>;export type Choice = TypeFromTypeInfo<typeof Choice>;export type ErrorBody = TypeFromTypeInfo<typeof ErrorBody>;export {pollsApiSchema};

We use the TypeFromTypeInfo generic to transform our runtime type definitions back into TypeScript type definitions.

Server

For the most part, everything in here is a typical express web server. There are only two things we need to do a little differently:

Create a decorated router for our API using the schema we defined earlier
Create enhanced route handlers for the routes in our API

Creating a decorated router

In this project we create the decorated API router in server/src/index.ts:

const pollsApi = decorateExpressRouter({  schema: pollsApiSchema,  onValidationError: validationErrorHandler,});

We supply this function our schema and optionally a validation error handler, and it gives us back an express router.

We can now mount this router and provide route handlers to the routes:

app.use('/api', pollsApi);pollsApi.get('/polls', getPollsRouteHandler);pollsApi.post('/polls', postPollsRouteHandler);pollsApi.get('/polls/:id', getPollByIdRouteHandler);pollsApi.post('/polls/:id/choices', postChoiceRouteHandler);pollsApi.post('/polls/:id/choices/:choiceId/vote', postVoteRouteHandler);

The type checking on .get and .post will not allow me to provide handlers for routes I have not defined. It will also provide type hints and autocomplete suggestions for the available routes.

Creating enhanced route handlers

We import the route handlers from server/src/routes/polls.ts where we define them as follows:

export const getPollsRouteHandler = createRequestHandler(  pollsApiSchema,  'GET',  '/polls',  async (req, res) => {    res.json({ polls: await pollsRepo.getPolls() });  });

We use the createRequestHandler to return a well typed route handler. It also strictly enforces the types we provide in the handling function for the last argument, based on the previous three.

The enhanced request handler provides the following benefits:

It does not compile when we miss any properties in the object provided to res.json.
It removes any additional properties provided to res.json. This mitigates against leaking sensitive information.
It validates the incoming request payload, returning a 400 error if it does not comply with the schema for that route and HTTP verb combination.

Types flow through to the domain layer

We can import the types from our API schema for use throughout our code base, such as in domain code:

import {Poll} from 'api-schema';type ValidateChoiceOutcome = 'VALID' | 'INVALID_POLL_TYPE' | 'DUPLICATE_CHOICE';export const validateChoiceForPoll = (choiceText: string, poll: Poll): ValidateChoiceOutcome => {  if (poll.type === 'OPEN') {    if (poll.choices.map(c => c.text).includes(choiceText)) {      return 'DUPLICATE_CHOICE';    }    return 'VALID';  }  return 'INVALID_POLL_TYPE';}

Now we can be certain our domain implementation is in sync with our API definition.

Types flow through to the persistence layer

We can also use the API types down in our persistence layer code:

import { Choice, ChoiceInput, Poll, PollInput } from 'api-schema';export type PollsRepo = {  getPolls: () => Promise;  getPollById: (id: number) => Promiseundefined>;  getChoicesByPollId: (pollId: number) => Promise;  createPoll: (poll: PollInput) => Promise;  createChoiceForPoll: (choice: ChoiceInput, pollId: number) => Promise;  addVoteForChoice: (pollId: number, choiceId: number) => Promise;};

When using an SQL backed persistence layer we may even provide mapping functions to and from our API types and our database IO types.

Client

Similar to the server, we can utilise our api-schema types throughout the client code. Additionally, we can use a client api package provided by http-schemas to:

Add type checking to our path parameters and json request bodies.
Add type information to the response data returned from our API.

Creating an apiClient

We initialise this api client in src/apiClient.ts:

import {createHttpClient} from "http-schemas/client";import {pollsApiSchema} from "api-schema";const baseURL = 'http://localhost:8080/api';export const apiClient = createHttpClient(pollsApiSchema, { baseURL });

Now we can use it in hooks, such as this one that drives most of the client-side logic:

export const usePolls = (apiClient: HttpClient<typeof pollsApiSchema>): UsePollsReturn => {  const [status, setStatus] = useState('READY');  const [polls, setPolls ] = useState([]);  const refreshPolls = async () => {    if (status === 'LOADING') {      return;    }    setStatus('LOADING');    const result = await apiClient.get('/polls'); // <= Result is { polls: Poll[] }    setPolls(result.polls);    setStatus('READY');  }  // REMAINDER TRUNCATED FOR BREVITY

Types flow through to logical components

Using our API types in our logical components allows us to ensure that our client behaves predictably when interacting with the API, for example:

  const addPollSubmitHandler = async (pollInput: PollInput): Promise<void> => {    const result = await apiClient.post('/polls', {body: { pollInput }});    addPoll(result); // Result here is type `Poll`  }

Types flow through to presentational components

Using these types in our presentational components keeps our presentation in sync with our API code, allowing us to ensure our data requirements are met for our presentation. For example here's a snippet of the Poll.tsx component:

import * as React from "react";import {Button, Card, CardBody, CardHeader, Table} from "reactstrap";import {Poll} from "api-schema";import {ChoiceRow} from "./ChoiceRow";import {useState} from "react";import {ChoiceForm} from "./ChoiceForm";type Props = Poll & {  createChoiceVoteClickHandler: (choiceId: number) => () => void;  onSubmit: (text: string) => void;};export const PollCard = ({id, text, type, choices, createChoiceVoteClickHandler, onSubmit}: Props) => {  const [isFormOpen, setIsFormOpen] = useState<boolean>(false);  return (    'my-4'>              {type === 'OPEN' ? (                  ) : (          'float-right mt-4 text-muted'>This poll is fixed. You cannot add choices.        )}        {          // TRUNCATED FOR BREVITY        }

Conclusion

I hope this helps convey the power of a well typed, validated API. This is a solution unique to TypeScript, and as such, uniquely powerful.

Why Teaching is The Only Way to Write Better Code

Anthony Manning-Franklin — Sat, 22 May 2021 09:59:22 GMT

return !!someVar ? (  someArr.length ? someArr[0].someProperty : somethingElse || 'cheese') : foo ? getFooBar(foo) : setAndGetBar(foo);

This horribly contrived nested ternary statement could be the best way of expressing the solution to your problem (it probably isn't, but let's pretend). Should you use it? Or should you unpack it into several named variables and if statements? You've probably already answered in your head with your own personal opinion, but you actually shouldn't answer yet because you don't have enough information I haven't told you about the team that will be reading it.

Every day we go to work and we write code, and as we're writing code, we're consciously or unconsciously asking ourselves questions about the code we're writing:

Is it the right solution?
Are there any bugs I've missed?
Is this function doing too much?
How will I make sure this is working in production?
Should this wake someone up if it fails?

But a question we often forget is will everyone on my team understand what I've written and why?

Code is read far more than it is written, by an order of magnitude at least. It's imperative that our code is understandable to everyone mid-level and up (we don't expect juniors to understand everything, and we don't expect them to work as independently). If we're writing code that not everyone understands, then we're not team players, we're not helping the business, and while it might feel like we're creating job security if we create systems that only we can understand, we're actually building a lightning rod for our back as we make ourselves the bottleneck.

If people can only contribute to code that we've worked on by asking us questions about it and having us explain it to them, we will eventually find ourselves overwhelmed with queries from others and no time of our own to write code. Or you could ignore your colleagues and continue writing indecipherable code, but that is definitely a career limiting choice. Eventually, you will become the bottleneck to all significant work, and the business will start breathing down your neck to get everything done. You will be stressed, and you might feel like a rockstar, but it is a false high.

When I find a bottleneck engineer in a system (a Brent, I call them, from a character in The Phoenix Project), I politely ask they take a break from the project for a few weeks, stop answering questions about it, and ask for their help elsewhere. Inevitably, other engineers need to take up the slack, and struggle through the code, but eventually, knowledge about the system is shared, the bottleneck mitigated, and the team starts having opinions about how to improve the system so its simpler to work with.

So what do we do when we want to write more advanced code?

One word: Teach.

And that is no easy thing to do! But a rising tide lifts all ships, and believe me, it's much better to be the rising tide than a bottleneck. You won't be the rockstar people need and put up with, you'll be the champion everyone respects and loves working with.

How do we teach? Let's run through some techniques that I've found extremely useful in teaching others and raising the capabilities of the team. You'll likely find that you're naturally stronger at some of these than others, but you'll also find some of these work better for some members of your team than others. It's important to work with your other more senior colleagues to train the team with complimentary teaching styles.

Pair Programming

This is one of the most effective ways I've found to close skills gaps on a team. When pair programming, each of you sit together (or screen share) and view the code together in real time. If you only have one keyboard, then one person can act as the driver while the other acts as the navigator. The driver physically writes the code, while the navigator gives feedback and suggestions. It's important to switch roles frequently, commonly with each pull request.

This might sound like it will take longer, but overall it can be much faster. Pair programming tends to produce less defects, shares knowledge, results in simpler solutions, more maintainable code, and everything pair programmed has effectively been reviewed, meaning pull requests are merged faster rather than sitting in review limbo.

When I'm pairing with someone junior to me, I will often have them drive so that they can learn by doing. This helps them develop some muscle memory for the patterns and solutions, and helps them retain more of what they learn. When I'm driving, I ask a lot of questions, if I suggest an idea and write it out, I ask for their thoughts on it. I ask them what might be wrong with my idea, or "If you would change one thing about this, what would it be?" which is a great question to ask because it forces an answer while shielding them from the anxiety of potentially offending you.

Yes, some people will struggle to offer suggestions to colleagues they see as senior or more knowledgable. Be cognisant that your pairing buddy might be nervous if this is new to them. If so, make some harmless jokes about your own code, compliment them on their suggestions, and ask them questions that show you care about their opinion! Junior developers have beginners mind and can see things anew. At the same time, having to explain things to someone helps you learn too; it deepens and solidifies your knowledge.

Exercises and Kaizen

This is a great way to strengthen new or shaky concepts on a team. You can create coding exercises and challenges for a team that require them to solve problems in new ways. The team works on these challenges during their professional development time, and hones their craft before applying it in production. If you have a pattern or solution in your toolkit that your team is unfamiliar with, this can be a powerful way to bring the team with you so you can use it in production.

It's important that you don't tell people how to do things. We learn best when we're challenged; there is a difference between synthesising new ideas, recollecting knowledge, and recognising knowledge. This is why flash cards and spaced repetition work, because recollection strengthens pathways better than recognition. You have to struggle in order to learn, hence the increasing delay between challenges in spaced repetition. But more powerful than that is making the leap of discovery yourself. When you give your colleagues just enough information to have the "Aha!" moment themselves, they have a deeper and stronger understanding of the concept or solution. They're also more invested in it.

Instead of giving people the solutions to challenges, let them come up with their own solution, then ask lots of questions about their solution, offer some feedback, and only a few suggestions. Gently nudge people towards a better solution and then let them try again. It will take a few cycles, but don't rush it! You want people to make mistakes and get burnt here, they will learn much better, and won't do it in production.

On my teams, I like to run weekly catch up sessions where we all share our progress on these exercises. It's mob programming for professional development, it's highly effective, and you will learn a lot about the teaching and learning styles of your colleagues.

Talks and skill shares

Giving talks and presentations on a topic can be a great way of drumming up interest or awareness in that topic, approach, or pattern. To be clear, it isn't a great way of teaching or getting that knowledge deeply seated in your colleagues such that they can use it in production. It is a great way of introducing new and bigger concepts, and for practicing giving talks! If you want to introduce a new concept to the team, you might give a skill share first, then have the team work on exercises, and then pair with people as they implement it.

Remember, when you're giving a talk, you're giving a high level overview of various concepts. Your slides shouldn't contain too much text, at most three dot points relating to a single idea per slide. Please, don't just read what is written on the slides! Nothing puts people to sleep faster than listening to someone read off a slide deck. Your objective isn't to convey every detail and minutiae in a talk, your objective is to get people to ask you good questions at the end!

It's okay if the slides don't make too much sense on their own. They're only there to act as prompts.

Reading lists

A development team that reads together grows together. It's important you all share the same vocabulary and toolsets of concepts. Having a common set of books that the entire team has read helps with this immensely. If you've all read Domain Driven Design Distilled then you can have a conversation about ubiquitous language, domain boundaries, and domain mappings, and know that everyone is able to join in and engage.

You can always stick the reading list in git and have everyone contribute to it. Getting people to actually read them is the other half of the challenge. A little healthy competition here can be helpful, but be mindful that your colleagues will be at different life stages. A junior with no partner and no family might have a lot more time for reading than someone with kids and a newborn. That's just a fact of life.

If you keep talking about the books you are reading, and taking an interest in the reading habits of others, asking for suggestions and recommendations, you can encourage your team to read the reading list, especially if they've contributed to it themselves. Maybe even add it to your weekly learning exercise catch up.

Asking questions instead of making statements

I've mentioned this briefly throughout this article, but it's such an important concept it deserves its own section. The best way to teach someone anything, is through questions, not statements. It can be very tempting to give the answer, make a statement about the "correct" way to do something, because it makes us feel good.

It's not our job to make ourselves feel good and show off our knowledge, and it isn't how we get promotions either. It's our job to discuss ideas with our colleagues, fill knowledge gaps, and bring people on the journey with us rather than mailing them a postcard from the destination. You do this by asking questions.

Plus you never know, you might get an answer you weren't expecting. You might come up with a new idea or solution, or find out that your idea wasn't so "correct" after all. That's a good thing. Asking questions gives us the opportunity to find out we are wrong. You absolutely want to know when you're wrong as early as possible.

You'll also have better relationships with your colleagues. They will feel more respected, and you will respect them more, because you actually create opportunities for mutual learning. We all learn best in a psychologically safe environment, and questions help foster that sense of safety.

I need to be clear here: These NEED to be honest, open ended questions. If you ask too many leading questions it can come across as condescending. Ask questions with more than a single word answer. Ask questions that need an opinion, or are answered with another question! Don't ask yes or no questions, ask questions that query various elements of quality performance, reliability, robustness. Ask questions that move us to action. Ask "How could we implement observability here?" or "How could we make this more performant?"

Asking questions is an art form that you can spend your life perfecting, and has implications well beyond software engineering that are out of scope for this article. So make a habit of asking questions, you can only get better at it with practice.

Eating the elephant

There's a lot to take in here, and a lot to do. How do we do it all? How can you achieve what might be a big undertaking? The same way you eat an elephant one piece at a time. Introduce one of these concepts at a time. Get good at it, get your team good at it, and then introduce another.

If you go into the office and introduce pair programming, reading lists, programming exercises, and skill shares all at once, you will fail. It might take a week or a month for each of these to stick. You might find your colleagues are resistant to some of it. Figure out what makes sense for your team, and then start experimenting.

Taking responsibility

You might be reading this and thinking "But Anthony, I'm just a junior/mid-level/senior developer, I can't do all this, I don't have the authority." You don't need any. You may need to persuade people, you may need managers to give you time, you may need scrum masters to help you protect professional development time, but leaders don't need authority because they lead from the front.

If you want to be a great developer, you're going to need to develop a leadership style that suits your personality and yes quiet introverts can absolutely be effective leaders, quiet leadership is a real thing. You'll find that as you take responsibility, as you show leadership, people start giving you authority (and promotions). This is a double edged sword, authority is anathema to leadership. If you have any, avoid using it at all costs.

You also don't need to know everything or take responsibility for everything to be a great teacher. You just need to care about other people's learning, knowledge, and success. If you start from there, and build on that fundamental principle, you will succeed in being the rising tide that lifts all ships.

If you have any techniques that helped you share knowledge and expertise, please let us know in the comments!

Start strong with this JavaScript package template

Anthony Manning-Franklin — Thu, 20 May 2021 09:50:33 GMT

Publishing a package is a rite of passage for JavaScript and TypeScript developers these days. It's incredibly easy to do, but hard to do well. It can be daunting, putting our work out there for the world to see, for people to use. But there's more to making a good contribution to the world of open source software than just publishing your package. What about concerns such as testing, typing, licensing, work flows, pull request processes, contributing guidelines, linting, styling, and documentation?

I've created a package template that I believe addresses these concerns and gets you off to a strong start. But rather than just giving you the template and leaving it at that, I'll explain the choices I've made. That way you can make your own decisions and improvements. Plus if you understand it, you can submit a pull request with improvements, so that we can all benefit.

Check out npm-package-template and give it a go. In the rest of this article, I explain exactly what, how, and why.

TypeScript Setup

Yes, this template uses TypeScript. At this point, if you're going to be providing open source packages to the javascript community, you need to provide a well typed library. It's no longer a nice to have, but a must have. Lets take a look at configuring our tsconfig

{  "compilerOptions": {    "target": "es5",    "module": "commonjs",    "declaration": true,    "declarationMap": true,    "outDir": "./lib",    "strict": true,    "esModuleInterop": true,    "skipLibCheck": true,    "forceConsistentCasingInFileNames": true  },  "include": ["src"]}

Here we've started with "target": "es5" in this template because if you're shipping a front end library, that's what you will need to target. However for other use cases a more recent version of ECMAscript may be appropriate.

    "declaration": true,    "declarationMap": true,

These give us our declaration (.d.ts) files and map files automatically in our output when typescript compiles. These are the type files that ship with your package and help users with type hints and doc comments.

"outDir": "./lib",

Here we're simply specifying that we want to write our output to the lib folder at compile time. This folder is where we specify users will import from when they import our package, as specified in our package.json:

  "main": "lib/index",  "types": "lib/index",

    "strict": true,    "esModuleInterop": true,    "skipLibCheck": true,    "forceConsistentCasingInFileNames": true

The remaining checks above are all recommended typescript config options, providing strict type checking, helpers for babel via esModuleInterop, skipping of checks for declaration files, and consistent filename casing which can prevent bugs on different file systems.

Build Process

Building this package is simple and done entirely by the typescript compiler.

Building the project is done using npm run build which executes npx tsc -p .. We use npx to ensure we're using the version of typescript local to this package, rather than a global typescript installation that may be present. The -p flag with . ensures we're using this directory's tsconfig.

Testing

We've setup mocha as our test runner. The configuration for mocha lives in our package.json and is:

  "mocha": {    "extension": [      "ts"    ],    "watch-files": [      "src/**/*.ts",      "test/**/*.ts"    ],    "require": "ts-node/register"  },

This tells mocha to test typescript files, using ts-node, and in watch mode to re-run when changes occur in either src or test directories. Starting mocha in watch mode is done by running npm run watch which calls npx mocha -w. This is the main way you will develop your package, as it allows very effective red/green/refactor cycles following TDD.

Linting & Styling

For linting we've configured ESLint to support typescript and made a few package related choices, the .eslintrc.json is:

{  "extends": "eslint:recommended",  "env": {    "commonjs": true,    "node": true  },  "parserOptions": {    "ecmaVersion": 2020,    "sourceType": "module"  },  "rules": {    "no-console": "off",    "strict": ["error", "global"],    "curly": "warn"  }}

Our prettier config is stock-standard, and in package.json is just

  "prettier": {    "singleQuote": true  },

We've also configured lint-staged to run when we stage files, telling it to run both prettier and eslint in write and fix mode respectively:

  "lint-staged": {    "*.ts": [      "prettier --write",      "eslint --cache --fix"    ]  },

With all of the above, writing professional, consistent code is as easy as falling off a log for you and your contributors.

Development Experience

Contributors can get started as easily as

git clone <path to our repo>cd name>nvm usenpm i --cinpm run watch

nvm use ensures everyone working on this package is using the same version of node, as it picks up the node version specified in the .nvmrc file.

npm i --ci is shorthand for npm install and the --ci flag instructs npm to install dependencies precisely matching the lock file, which is considerably faster.

GitHub Setup

Following the instructions in the npm-package-template readme will result in a Github repo configured with these features:

Protected Branches

We want our main branch to only include code that passes CI and has been reviewed.

Continuous Integration

This package template also includes a basic CircleCI config and instructions on how to get up and running with CircleCI.

Documentation

Documentation can make or break an open source library. People will often read your documentation to decide if they want to use your package. In this template we use TypeDoc and TypeDoc Markdown to generate markdown documentation files from our README.md and the docstrings in our typescript source code!

That means code like this

/** * Use this function to create a greeting string with the given name * @param name The name to greet */export const makeHello = (name?: string): string => `Hello ${name || 'world'}`;

results in markdown like this

## Table of contents### Functions- [makeHello](modules.md#makehello)## Functions### makeHello `Const` **makeHello**(`name?`: *string*): *string*Use this function to create a greeting string with the given name#### Parameters| Name | Type | Description || :------ | :------ | :------ || `name?` | *string* | The name to greet |**Returns:** *string*Defined in: [index.ts:5](https://github.com/Antman261/npm-package-template/blob/f2945d1/src/index.ts#L5)

Which then gets picked up by GitHub Pages, rendered by Jekyll against a template of your choosing, and served up as html! This is how https://tiny-fixtures.com is generated and hosted completely free.

These docs are generated by running npm run build-docs which executes npx typedoc --out docs src/index.ts --disableOutputCheck

--out docs just specifies the output directory as docssrc/index.ts specifies our starting file, everything exported here is documented.--disableOutputCheck flags that we don't mind if there are existing files in the output dir. This is because our GitHub Pages config files live in ./docs too.

GitHub Pages

Full setup for configuring GitHub Pages is outlined in the README. This gives you a free way to host your documentation!

Package.json Config

Remaining package.json config that we haven't talked about all pertains to publishing our package on npm.

  "files": [    "lib",    "LICENSE",    "README.md",    "package.json"  ],

This simply lists the files we want to include in our package tarball sent to npm when published.

"prepublishOnly": "npm run build && npm run build-docs"

This ensures that our changes have been built before publishing! This can prevent embarrassing mishaps such as publishing a new version with no changes because we forgot to build before publishing.

Publishing

As long as we have logged in to npm, and all the template strings filled in as per the README, publishing is just npm publish. Voila!

Conclusion

I hope this guide has made publishing an npm package less daunting, and rendered your journey as an open source contributor a little easier.

Let me know how you went in the comments! And feel free to open a pull request against npm-package-template if you find a better way of doing something.

Easy Node Postgres fixtures with tiny-fixtures

Anthony Manning-Franklin — Tue, 18 May 2021 11:35:55 GMT

Years ago, I used to work almost exclusively with a Python web framework called Django. It heavily borrowed from Ruby on Rails, and one of the things I liked about it was how easily it allowed you to setup test data in a database for your tests. All you had to do was specify some JSON, and it inserted this data into the database before each test case, then cleaned it up afterwards.

You could even generate this data programmatically from data in the database via the Django command line interface tools. What a breeze!

Then I came to TypeScript and found nothing quite so easy to use and reliable for database fixtures. Instead we wrote our own insert and delete functions for our tests, and had to keep track of inserted IDs if we wanted to clean up without nuking our local database. Not only was this a lot of really pointless code to write, it's easy to make mistakes too. Sometimes you might run the test suite multiple times and the database gets into an unexpected state, causing tests to fail because something wasn't cleaned up properly. Yuck!

I couldn't keep working like this and decided to create a library to make database fixtures easy. That library is tiny-fixtures.

How tiny-fixtures works

The premise is pretty simple, you supply tinyFixtures a node-postgres connection pool, and it returns a createFixtures function.

You then use that function to, well, create fixtures! You supply it the name of the table and an array of objects to insert. It gives you back an array containing a setup function, a teardown function, and the array of objects you want to insert with some helper functions attached.

The objects in the array that you supply createFixtures should be plain javascript objects. These represent the rows you want inserted, where the keys are the column names and the values are the values for that column! Easy right?

If you need to insert data in another table, and reference the first table by foreign key, then you call createFixtures again, but this time you reference the foreign key value by calling a lookup function on the associated object from the first createFixtures call.

An example is in order! In this example we are addressing the more complicated use case of inserting to two tables, one of which has a one to many relationship with the other.

  describe('Two table with join use case', () => {    const [setupUserFixtures, teardownUserFixtures, users] = createFixtures('users', [      {        email: 'foo@bar.co',        username: 'tinyAnt'      }, {        email: 'bar@foo.co',        username: 'antTiny',      }    ]);    const [setupUserMessageFixtures, teardownUserMessageFixtures] = createFixtures(      'user_messages',      [        {          user_id: users[0].getRefByKey('id'),          message: 'Foobar did the bar foo good',        },        {          user_id: users[0].getRefByKey('id'),          message: 'I am a meat popsicle',        }      ],    )    beforeEach(async () => {      await setupUserFixtures();      await setupUserMessageFixtures();    });    afterEach(async () => {      await teardownUserMessageFixtures();      await teardownUserFixtures();    });    it('should have a user with two messages, and one with none', async () => {      const users = await getUsers();      expect(users.length).to.equal(2);      expect(users[0].messages.length).to.equal(2);      expect(users[1].messages.length).to.equal(0);    });  });

All we had to do was specify our test data, and then call our setup and teardown functions in the before, beforeEach, after, or afterEach hooks as appropriate. Now the code I'm writing for my integration tests directly adds value to my tests, rather than test specific setup and teardown helper functions or even mocks and stubs.

So user_id: users[0].getRefByKey('id'), might look a bit weird, but that's because when you are setting up your fixtures, the sequence values from the database don't exist yet, and could be different for every test case. Instead getRefByKey takes a string containing the name of a column that you want from the row at runtime, and will look that column up each time it inserts a row.

How did I write this library?

The first step I took when I realised I wanted to use this library was actually writing the docs. I wrote basic documentation for a library I wish I had, and then shared it amongst my peers, asking if they would use it. I received a lot of emphatic yes's, some with heavy sighs and some with a little begging. I took that as a good sign and started writing my library.

Now I have a basic use case covered, I'm waiting to see if it gets traction before I add any further enhancements.

Some improvements I'm considering are:

A command line tool to pull data out of a database and save it as JSON for tiny-fixtures.
The ability to read JSON files as fixtures
Saving a log of inserts to a file so that test runs with errors that prevent cleanup are recovered from seamlessly in subsequent runs.
MySQL and other database support, if requested.

Have a play and let me know what you think of tiny-fixtures in the comments!

The single biggest mistake you could make in TypeScript

Anthony Manning-Franklin — Tue, 18 May 2021 10:20:00 GMT

There is one thing in TypeScript that will undermine all your work and sabotage every benefit you might get from TypeScript. It's a very simple, insidious tsconfig.json value, noImplicitAny: false. Once you have code with implicit anys, it is very hard to fix. I have made this mistake in the past; I know how much it hurts.

Lets quickly take a look at an example

function calculateLoginDestination(user, path) {    if (user.validated) {        return `main/${path}`;    }    return calculateValidationRequestPath(user, path);}

This might look like a harmless little function, do we really need types? Well, in a way this function does have types for both its parameters and its return value. The type is any because no type was specified. What happens if I call the function like this?

const path = req.queryParams.path;const user = await getUserBySessionToken(req.cookies.token);const destinationUrl = calculateLoginDestination(user, path);

Given the context that this function is used, the types for calculateLoginDestination might actually be as follows:

type calculateLoginDestination = (user?: User, path?: string) => string;

Now the problem with the above code should be obvious we dont check the presence of values! If user is undefined or null, the function will throw an error.

But it gets worse! What if I am changing calculateValidationRequestPath and I accidentally change it to return an object instead of a string? Since the return type of calculateLoginDestination is implicitly any, typescript wont warn me. Now destinationUrl is actually string | object! An entire codebase of poorly typed functions quickly becomes a nightmare to modify, whether refactoring or adding features.

Lets see what would happen if we had enabled the noImplicitAny rule from the beginning.

function calculateLoginDestination(user: User, path?: string): string {    if (user.validated) {        return path ? `main/{$path}` : `main/`;    }    return calculateValidationRequestPath(user, path);}

And in the calling context

const path = req.queryParams.path;const user = await getUserBySessionToken(req.cookies.token);if (!user) {    res.status(401).json({ error: Invalid session });    return;}const destinationUrl = calculateLoginDestination(user, path);

Our code now properly guards against missing users, and changes that might affect this code. If I modify getUserBySessionToken such that it returns a result object either containing a user or an error message, TypeScript will no longer compile until I update the above code.

If I modify calculateValidationRequestPath such that it no longer returns a string, typescript will no longer compile until I either fix the function or modify the return type of calculateLoginDestination and its callers.

What if I already have a codebase with implicit any?

Ive been there. You start a project with noImplicitAny turned off, and later regret it. How best to fix it? There is really only one approach.

Turn on noImplicitAny and add explicit any throughout your entire code base!
Have the engineers on your team add types until there are no more anys.

The first step has to be a Big Bang change. It is painful, but necessary. If your code base is so large no one can make a change that large in a single commit, then turn it on file by file. But it is paramount that you get this done in a single day.

The next step is eating the elephant. You have to do it in small chunks, piece by piece until it is done. As you add types, it has a positive contagion effect, where the types propagate throughout your codebase and the amount of typing increases over time. Hopefully you can get the team to swarm on it and add types until the entire codebase is converted in one day. If you cant, at least with the noImplicitAny rule turned on your code wont get much worse in the meantime.

Hopefully you never have to deal with implicit any's, or any's at all for that matter. These days, tsc --init has noImplicitAny turned on by default, so it shouldn't be an issue. And if you ever see a pull request disabling the noImplicitAny rule, do the world a favour and request changes.

Let me know in the comments if you have any implicit any horror stories of your own!

TypeScript's Discriminated Unions With Real Use Cases

Anthony Manning-Franklin — Sun, 16 May 2021 08:07:18 GMT

Discriminated Unions are a powerful TypeScript feature that can lead to some very robust code that scales well on large teams. However it is a rather underrated & under utilised pattern. I'm going to give a real world example for using discriminated unions with a Redux example. You could also apply this pattern to message handlers, mappers, and more.

Reducers

In this example we will build a simple reducer for managing some settings in an application. In this case, the discriminated union helps:

ensure our code addresses every possible action in the system
gives us fantastic type hints inside the branches of our code
prevents us from attempting to access the wrong payload attributes

Let's take a look at the code first, then I will show you examples of modifying this code and how it helps prevent errors.

/*Lets imagine we're creating a settings page. Here the user can:  - toggle sound on/off  - change the appearance (theme)  - enable/disable noitifications  - Set their username */type SettingsState = {  isSoundOn: boolean;  areNotificationsEnabled: boolean;  theme: 'DARK' | 'LIGHT' | 'OCEANIC';  username: string;};type SetSoundAction = {  type: 'SET_SOUND';  payload: {    isSoundOn: boolean;  };};type SetThemeAction = {  type: 'SET_THEME';  payload: {    themeName: SettingsState['theme'];  };};type EnableNotificationAction = {  type: 'ENABLE_NOTIFICATION';};type DisableNotificationAction = {  type: 'DISABLE_NOTIFICATION';};type SetUsernameAction = {  type: 'SET_USERNAME';  payload: {    username: string;  };};type SettingsAction =  | SetSoundAction  | SetThemeAction  | EnableNotificationAction  | DisableNotificationAction  | SetUsernameAction;const initialState: SettingsState = {  isSoundOn: false,  areNotificationsEnabled: false,  theme: 'LIGHT',  username: '',};function settingsReducer(  state = initialState,  action: SettingsAction): SettingsState {  switch (action.type) {    case 'SET_SOUND':      return {        ...state,        isSoundOn: action.payload.isSoundOn,      };    case 'SET_THEME':      return {        ...state,        theme: action.payload.themeName,      };    case 'ENABLE_NOTIFICATION':      return {        ...state,        areNotificationsEnabled: true,      };    case 'DISABLE_NOTIFICATION':      return {        ...state,        areNotificationsEnabled: false,      };    case 'SET_USERNAME':      return {        ...state,        username: action.payload.username,      };    default:      return isNeverAction(action.type, 'settingsReducer');  }}function isNeverAction(action: never, reducer: string): never {  throw new Error(`${reducer} received invalid action ${action}`)}

Let's take a look at what happens if someone adds an action and forgets to write a reducer for it.

I'll add an action like this:

// other actions abovetype SetEmailPreferences = {  type: 'SET_EMAIL_PREFERENCES';  payload: {    marketing: boolean;    transactional: boolean;  };};type SettingsAction =  | SetSoundAction  | SetThemeAction  | EnableNotificationAction  | DisableNotificationAction  | SetUsernameAction  | SetEmailPreferences;

Now if I try to compile I get the following error

src/discriminated_unions.ts:99:28 - error TS2345: Argument of type 'SetEmailPreferences' is not assignable to parameter of type 'never'.99       return isNeverAction(action, 'settingsReducer');                              ~~~~~~

And in my IDE this looks like

Awesome! Let's take a quick look at that type hinting.

TypeScript is able to tell WebStorm that the available properties for action.payload inside this code branch can only be marketing or transactional because we're looking at a discriminated type. An action with the type value SET_EMAIL_PREFERENCES can only have a payload of type { transactional: boolean, marketing: boolean }

Anatomy of a Discriminated Union

The most important aspect of a discriminated union is a property that acts as what is called the "discriminator". This is the property the compiler will use to differentiate between the different possible types. In the above example the discriminator was type, but it could be called anything. Another common name for discriminator properties is kind.

The discriminated union works because the discriminator itself is a narrow type, the union of possible types.

If we were to inspect the value of the discriminator in the above example, it is:

This has happened because we used a literal member to type our discriminator in each of our action types.

type SetEmailPreferences = {  type: 'SET_EMAIL_PREFERENCES'; // This is a literal member  payload: {    marketing: boolean;    transactional: boolean;  };};

In the above example, type is a literal member because it is a value in a type definition. This is the key to implementing discriminated unions.

What about that never?

We used the never type to implement exhaustive checking. This means the code won't compile unless we've addressed every type with a specific case in the switch.

How does this work? Let's break down the isNeverAction function

function isNeverAction(action: never, reducer: string): never {  throw new Error(`${reducer} received invalid action ${action}`)}

Never is a type that won't accept any value, and we've used it for our action parameter. Passing any type to a never will always produce a compile time error. For example

const cantSetMe: never = '';

This won't compile because '' is a string and we're trying to assign it to a never.

Also notice that isNeverAction also returns a never. This is the true return type for a function that can only throw. It doesn't return void because in fact it can never return at all.

In the code, we called the isNeverAction function inside the default case at the very end of the switch statement. However TypeScript knows that the expression in the switch can only be the values in the type SettingsAction['type'], so if you provide a case statement for each one, and return or break in that statement, then it cannot reach the default case.

However, if we do forget a case, then we pass action.type to the never, and TypeScript will complain.

So why do we wrap this up in a function that throws? Well, TypeScript checks only work at compile time. If this were say, a discriminated union based on types from a database or 3rd party API, then we could encounter unpredicted types at runtime. If this does happen, it means the application has corrupt data and is in an unknown state. We should throw an error and stop what we're doing. This will help us find and diagnose errors early.

What exactly is a `never`?

The never type is what we would call a bottom type in Set theory. Huh? So a type is really the set of all possible values. For example, the type string is the set of every possible string ever.

So type Foo = 'foo'; is a member of the set string. It's also a member of the set string | number as is 0, or any number.

It's also true that every type is a member of the set unknown, which is every possible value. We call unknown the top type, because it is a superset of all other types.

Well if never is the bottom type and unknown is the top, does that mean it is the opposite of unknown? Uh, kind of. Unknown is the superset of any type, but never is a subtype of every set.

Yes, that means string contains never, as does number, but you cannot set a never to any value from string because never is an empty set.

never is even a subtype of sets you may make, so 'FOO' | 'BAR' | 'SAUSAGE' includes never, but never of course cannot include the set.

This means that any given set, in union with never, is equivalent to the set. never is the identity element of the union operation.

T | never =>T// more concretely'FOO' | 'BAR' | never => 'FOO' | 'BAR'

You'll recognise that a union with never is very similar to addition with 0, i.e.

number + 0 => number1234 + 0 => 1234

Yes, 0 is the identity element of the addition operation.

TLDR: Check your unions against never to implement exhaustive checking.

Conclusion

Hopefully now you have a new tool up your sleeve to help you write reliable code. Yet again, TypeScript saves us precious time and effort by raising errors in our code at compile time rather than runtime. This pattern gives us a powerful way for TypeScript to yell at us when we make changes and forget something, especially in a large code base with several developers.

And a powerful use-case for the never type! Have you seen any other great usages of discriminated unions or never? Let us know in the comments.

Functional TypeScript in Production Systems

Anthony Manning-Franklin — Sun, 09 May 2021 09:23:17 GMT

Functional programming has enjoyed a massive surge in popularity amongst JavaScript developers in recent years. This is likely partly due to the popularity of libraries like Redux and React which promote functional programming styles, and have proven themselves both viable and in fact extremely valuable.

With TypeScript, functional programming isn't just nice, it's extremely beneficial in production systems for highly scalable business critical applications, both frontend and backend. How do I know? In my current day job, we run a billion dollar plus product on Functional TypeScript, Node, and React, serving tens of millions of requests per day.

What is Functional Programming?

I'll try to keep this section brief, since functional programming has a lot more mindshare these days, let's just make sure we align on what exactly we mean by functional programming in this context before we move on.

Functional Programming works well in JavaScript largely because it was originally pitched as Scheme in the browser. This is why functions are first-class citizens in JavaScript and thus TypeScript.

Functional Programming isn't

It isn't just using functions!

Let's start by listing some things we don't do, or at the very least endeavour to avoid in our functional code.

Object mutations such as result.value = 5;
Variable mutations, so no var or let
while loops
for loops
for in ... loops
Functions with side effects (we can manage or isolate components of our application that induce side effects, but core business logic should be implemented via pure functions)
Functions typed as void or Promise
Array mutations, e.g. pop, push
Pass-by-reference input values for mutation!

So what is functional programming? Let's start with a core tenet.

Pure Functions

Pure functions must always return the same output for a given input. They must not have any side effects. Think of methods on objects often, if I call the same function multiple times, it has a different output, i.e. counter.incrementCounterBy(5) we can imagine would return a different result every time it is called. That's because it is tracking state that exists outside the method, in this case the instance it is a method of. Even worse, methods may produce different results due to calls from other parts of the system. In the above example, if another part of the system called counter.resetCounter(), I might not get back the value I expected. These aren't pure functions at all, and if I simply refactor my code so resetCounter and incrementCounterBy are functions that read and write from some global or even passed in value, I haven't made them functional.

Let's have a look at what a pure function is. Normally, at this point in an article we get the very contrived example of

function sum(x, y) {  return x + y;}

Which, while functional, isn't a terribly inspiring argument for functional programming in serious business systems.

Instead let's see how we might encapsulate some business logic in a pure function. We're going to create a pure function to determine if a given user should be allowed to login, given some stringent requirements.

As such, the first part of this implementation will be establishing some rich types for us to use. Be patient! The functional code is coming.

type User = {  id: number;  email: string;  name: string;  accountStatus: AccountStatus;  verifications: Verification[];};type AccountStatus = 'ACTIVE' | 'PENDING_VERIFICATION' | 'LOCKED';type Verification = {  kind: VerificationKind;  validated: boolean;  payload: EmailVerificationPayload | PhoneVerificationPayload;};type VerificationKind = 'EMAIL' | 'PHONE';type EmailVerificationPayload = {  email: string;  isHighRiskDomain: boolean;  confirmationReceived: boolean;};type PhoneVerificationPayload = {  phoneNumber: string;  confirmationReceived: boolean;};type LoginAttempted = {  user: User;  attemptNumber: number;};type LoginAttemptResult = {  result: 'LOGIN_SUCCEEDED' | 'LOGIN_REJECTED';  reason?: string;}// okay, now we can write our checking functionconst checkUserCanLogin = ({ user, attemptNumber }: LoginAttempt): LoginAttemptResult => {  if (attemptNumber > 3) {    return {      result: 'LOGIN_REJECTED',      reason: 'Too many login attempts',    };  }  switch (user.accountStatus) {    case 'ACTIVE':      return { result: 'LOGIN_SUCCEEDED' };    case 'LOCKED':      return {        result: 'LOGIN_REJECTED',        reason: 'Account has been locked',      };    case 'PENDING_VERIFICATIONS':      const pendingVerifications = user.verifications.filter(v => !v.validated);      if (pendingVerifications.find(v => v.kind === 'EMAIL')) {        return {          result: 'LOGIN_REJECTED',          reason: 'Email verification pending',        };      }      if (pendingVerifications.find(v => v.kind === 'PHONE')) {        return {          result: 'LOGIN_REJECTED',          reason: 'Phone verification pending',        };      }      throw new Error(`Corrupt verification data for user ${user.id}`);      // This is an exceptional case, we want the application to blow up, and completely halt. We should never use errors to branch behaviour on in typescript - errors are untyped and poorly supported in TypeScript. This is very different from Python, where branching on Exception types is encouraged.    default:      isNever(`user ${user.id} had invalid status ${user.accountStatus}`);      // This should never happen unless corrupt data somehow enters our system      // Typescript will not let code compile that it sees can reach the default case  }}

There's some code we could see in a production system! It's complex, it enforces a lot of business rules, for example email verification comes before phone verification perhaps phone verifications cost more money than email verifications, so the business wants this rule enforced in code.

Imagine how easy the test is to setup and run for this code! We can now easily write tests for each case, without worrying about issues such as setting up and tearing down database fixtures or stubbing out APIs. We're not making calls to third parties here, there are no promises, there is nothing but data and logic. Writing tests for every scenario becomes somewhat trivial, which means your tests can easily document all expected behaviour.

This is where pure functions in production systems make sense. But how DO I talk to a database or external API in a functional programming context?

Partially Applied Controllers

This is a pattern I've personally dubbed partially applied controllers. Please, if you know of a better name for this pattern, let me know in the comments!

So, we want to talk to some external, non-deterministic systems (i.e. multiple calls with the same parameters ARE NOT guaranteed to return the same result), get back a result, pass that data to our pure function, flush the result to our persistence system (database) and return the result to the caller so they can operate based on that result.

Now, we want this logic to live inside a function that doesn't have any implicit dependencies. Everything it needs we should be able to pass to it. We also want to be able to write some tests to ensure the controller works, but we don't need to write integration tests to cover every conceivable business case, because thankfully those are covered by the unit tests on the checkUserCanLogin function.

First of all, we will model our persistence store and a third party API each as async data repositories. We're going to supply this repository to a function that returns another function that we can then use to query a given userId and find out if they can login.

Let's get started

type UserRepository = {  getUser: (userId: number): Promise;};type LoginAttemptRepository = {  getCount: (user: User): Promise<number>;  incrementCount: (user: User): Promise<number>;  resetCount: (user: User): Promise<number>;};// LoginAttemptCountRepository is a little contrived, but imagine we need data from a secondary sourceconst createCheckUserCanLoginController = (  userRepo: UserRepository,  loginAttemptRepo: LoginAttemptRepository,) => async (userId: number): Promise< LoginAttemptResult> => {  // get our data  const user = await userRepo.getUser(userId);  const attemptNumber = await loginAttemptRepo.getCount(user);  // perform business logic  const loginAttemptResult = checkUserCanLogin({ user, attemptNumber });  // flush the result, could wrap this up in its own function since branches are a code smell in partially applied controllers  switch (loginAttemptResult.result) {    case 'LOGIN_SUCCEEDED':      await loginAttemptRepo.resetCount(user);      break;    case 'LOGIN_REJECTED':      await loginAttemptRepo.incrementCount(user);      break;    default:      isNever(`user ${user.id} received invalid result ${loginAttemptResult.result}`);      break;  }  return loginAttemptResult;};

I intentionally left the implementation of the repositories out for now, because I want to demonstrate quickly why we're applying this pattern. Looking at those types I've written, is there any reason a repository HAS to be an actual database?

I could just as easily write these repositories as in-memory stores. In fact it might be easier. Then I could write out the full implementation of my feature before even considering how we're going to persist the data, or more importantly, I can write the tests first, then get this working with an in-memory repo before writing up my database layer.

Using this function itself is as easy as follows:

const checkUserCanLoginWithMemory = createCheckUserCanLoginController(  memoryBackedUserRepo,  memoryBackedLoginAttemptRepo);const checkUserCanLoginWithDatabase = createCheckUserCanLoginController(  databaseBackedUserRepo,  databaseBackedLoginAttemptRepo);// For completeness, here's what calling this looks likeconst loginResult = await checkUserCanLoginWithMemory(userId);

Let's write implementations for one of these repositories now, just so nothing is left to the imagination.

const users: User[] = [  // mock user not included for brevity];const memoryBackedUserRepository: UserRepository = {  getUser: async (userId: number): Promise => users.find(u => u.userId === userId);};

Yes, if you haven't realised already, I've left checking for non-results out in this code. We're going to pretend find always returns a result, and users always exist in a database.

Let's add a database repo

const databaseBackedUserRepository: UserRepository = {  getUser: async (userId: number): Promise => {    const result = await pool.query(`        SELECT * FROM users u        JOIN user_verifications uv ON u.id = uv.user_id        WHERE u.id = $1      `,      [userId]    );    return result.map(userDatabaseResultToUser).shift();  },};

So you're probably thinking "Hang on Anthony, this doesn't look very functional! This is just like the example you gave earlier of non-functional functions!"

You're right. The controller encapsulates all the impure behaviour in a single function. It allows us to keep other parts of the application pure while maintaining a functional style. That's one of the reasons branching is a code smell in partially applied controllers you don't want business logic sneaking into the impure parts of the code through an if statement here or there.

In our example, we can use the in-memory tests to assert that the flush operation logic is maintained. We could also write tests for the database-backed repository itself if we want to ensure that our counters worked properly against the database, or that our join returned the correct data.

But when we test the rules around who is allowed to login, the only thing under test is our implementation of those business rules, not the database, and not our mocks.

I hope the above demonstrates how we can use functional programming in our every day production systems!

Please leave a comment if you have any questions, suggestions, or improvements.

Programming a computer is trivial, programming your colleagues is not

Anthony Manning-Franklin — Tue, 07 Feb 2017 13:44:08 GMT

A lot of people believe that programming is about formulating a set of instructions for a computer. But once someone has been programming for a while, it becomes apparent that this not the real challenge. Anyone can get a computer to behave as instructed, and in fact they behave perfectly almost 100% of the time.

What complicates the process is our understanding of the intended behaviour and our mental model of the data structures involved. As humans we come pre-loaded with context and assumptions that sit just below the surface of consciousness. A bug in a program is never a computer behaving poorly but a mental blindspot in our understanding of the intended behaviour versus the instructed behaviour.

In fact, we've made programming much more complicated for the sake of making it more understandable. Entire disciplines have been invented out of thin air to make up for the fact that humans are inherently terrible at thinking logically.

A great example of this is Object Orientated Programming (OOP), an entire paradigm that exists solely in our heads. OOP says "Lets think about our problem set in terms of the nameable objects in that problem, their properties, relationships, and methods/functions (actions, in layman speak)". So for example a car might be described like this:

class Car:    wheels = new Wheels(4)    motor = new Motor('large')    def accelerate(self):        self.wheels.releaseBrake()        self.motor.increaseCombustion()    def stop(self):        self.motor.ceaseCombustion()        self.wheels.applyBrake(5)my_car = new Car()my_car.accelerate()my_car.stop()

Here we've said "Hey, I think a car should be an object that consists of two other objects, wheels and motors. Wheels can apply brakes and reduce speed, while motors can increase combustion thus supplying acceleration. A client using the car object can access these sub-objects by calling accelerate() and stop() on the car".

And this works, because smart people have created languages that translate this very human idea of interrelated objects and functions into something the computer understands - saved data in memory and saved functions in memory. Nothing more. In reality, computers haven't become any more sophisticated since the invention of transistors, but our ability to translate our ideas into code has, through more sophisticated intermediary languages, along with our access to machines capable of executing more instructions in less time.

So back to programming people. A programmer's objective is never really to make the program run, but to make sure they and other programmers can understand the program. While simultaneously ensuring that they understand the requirements well enough to know if the program is achieving its goal.

From this perspective, the stereotype of the socially inept programmer doesn't make much sense. A programmer should be a master of communication, if you're not agonising over variable names or the overlap of vocabulary between the problem space and technical vocabulary, then you're focusing on the wrong problems.

Every Programmer is Fighting Themselves

Anthony Manning-Franklin — Fri, 03 Feb 2017 13:36:06 GMT

First published in 2017

Every programmer brings two selves to each coding problem or even each line of code, and only one of you can win. The sooner you realise you are your own enemy, the better.

This is a general truth in all aspects of life, but as programmers we sometimes forget that we are humans running on a buggy meatOS with competing programs. These two programs are called short_term_interests and long_term_interests. In every decision we make we have to contend with our long term interests versus our short term interests.

Lets see what resources these two programs have at their disposal. Long term interests has intellect, planning, imagination and knowledge at its disposal. On the other hand short term interests has feelings. Oh dear, this is like having sudo access over your meatOS.

But when does this really matter? Well as programmers we all make the most logical, informed decisions with the knowledge at hand, 100% of the time dont we? HA! Every time you have skipped a test, lazily named a variable, skipped refactoring that code or neglected to write a detailed commit message youve let short term interests win. In other words, all instances of bad code are symptoms of poor emotional control.

So what will you do about it? The way I see it you have two options - keep making horrific mistakes until long term interests gains access to the initFear() method. OR you can use intellect, planning, imagination and knowledge to hack your meatOS now.

OKAY SO WHAT SORT OF SUBTERFUGE CAN WE RUN ON FEELINGS?

Well, we can learn to imagine the long term consequences of our actions more vividly. If youre avoiding writing a test, spend some time imagining coming back to this function later when its broken, your boss is pissed, the company has lost money because of you, and the a*hole who wrote this function didnt write any tests so you dont know what else might break when you fix it, or even what fixed means. I bet you feel a lot more like it now.

Another common error is thinking that you will do the hard boring bits tomorrow. Which is like saying Im going to wake up tomorrow and instantly have my ideal body because today I ate a heap of junk food. Instead, imagine that every decision you make today, youre committing to making repeatedly for the next 7 days. Didnt write a commit message? Well now you have 7 days without commit messages. These are the true stakes, because we will do again tomorrow what we did today. Now that one indiscretion feels a lot more serious doesnt it?

Some people might say but Anthony, I have too many important, urgent tasks to spend time looking after long term interests! Guess why you have so many important urgent tasks? Because every time short term interests wins, the long term version of yourself has to pay for it. Eventually all youre doing is paying debt with more debt and its game over. This is where you need habits and processes in place that prevent you from taking shortcuts. Things like setting up Continuous Integration/Delivery so you cant deploy without all tests passing on master branch. Set it up so that a build automatically fails if test coverage falls below a certain threshold. Sometimes the best thing you can do is limit your influence over the process.

It's often not a matter of avoiding feelings in order to meet long term interests - that's too hard. Instead, it can be a matter of building emotional stakes in favour of making the right choice. This is often what people value about "experience", it isn't so much the knowledge itself but the predisposition to use it sensibly. By being conscious of the process you can build emotions around best practice faster and with fewer mishaps. That way, when it's 4pm on a Friday, your coffee has ran out and Joe ate the last of your favourite snacks, your meatOS is less likely to make a bad choice.

Update 2021: I first wrote this post in 2017. While I still think a lot of it is true, the factor I missed is habits and practice. Kaizen pays big dividends here. If your patterns are muscle memory, you can spend your brain cycles on big picture problems, rather than trying to recall the pattern for read model populators or the docker syntax for multistage builds. Unfortunately that means typing the same thing over and over at somewhat regular intervals.