Essay March 30, 2026 10 min read

From laptop to dev server: running agent work across multiple machines

A useful agent workflow is not just about starting work from a laptop. It is about deciding where the work should run, preserving the same quality gates across machines, and keeping supervision intact when you are no longer in front of the original terminal.

The first time an AI coding agent feels magical, it usually happens on a laptop.

You type a task, the model explores the repo, drafts a plan, edits files, runs checks, and comes back with something that looks much more substantial than a normal shell alias or code generator ever could.

That first experience is important, but it also creates a bad default assumption: that agent work is basically a single-machine activity.

In practice, serious agent use spreads out quickly.

You start a task on a laptop because that is where you noticed the issue.
You want heavier implementation work to run on a workstation or dev server with a better toolchain or more stable network access.
You keep long-running tasks away from the machine you carry between meetings.
You may need to supervise a production-adjacent machine carefully without doing the initial thinking there.

The question stops being “Can the agent do the work?” and becomes “Where should this work run, and how do we keep control when the machine doing the work is no longer the one in front of us?”

That is the real multi-machine problem.

It is not mainly a sync problem. It is an operating problem.

The machine doing the work is often not the machine where the work starts

Developers already spread ordinary engineering work across multiple machines for good reasons:

laptops are convenient for starting things,
workstations are better for heavier local builds,
dev servers have the right network access or background services,
and production-adjacent boxes are sometimes the only safe place to inspect a live issue.

AI coding agents do not remove those distinctions. If anything, they make them more obvious.

An agent can spend meaningful time traversing a repo, running tests, rebuilding, retrying, and waiting on checks. That makes work placement more important, not less. The machine choice affects:

execution speed,
network reachability,
local credentials and environment parity,
tool availability,
how interruptible the session is,
and how much supervision friction you are taking on.

If you keep treating the laptop as the universal execution surface, you eventually get squeezed by one of two problems.

Either the laptop is the wrong place to run the work, or it is the wrong place to keep the only live control surface.

Multi-machine agent work breaks in quiet, expensive ways

The hardest part about multi-machine workflows is that they usually do not fail dramatically. They fail in small operational leaks.

You start a task on the laptop, but the laptop sleeps.

You SSH to a dev server to continue, but now the human context is split between two shells and one half-remembered terminal scrollback. (For a deeper comparison of the manual SSH and tmux approach versus a dedicated control plane, see MuxAgent vs. manual SSH and tmux.)

You launch a long verification pass on another machine, then leave your desk and realize the only practical way to intervene is to sit back down in front of that exact session.

Nothing is fundamentally impossible there. It is just awkward enough that teams make bad tradeoffs:

they keep work on the wrong machine because moving it is annoying,
they skip a checkpoint because returning to the right terminal is expensive,
or they leave a task under-supervised because the machine boundary turned a simple decision into a context-switch tax.

This is why “multi-machine support” should not be read as a convenience bullet. It is a throughput feature. If the operator cannot keep continuity when work moves between laptop, workstation, and server, the agent is only useful in the slices of the day when the human and the original machine happen to be colocated.

That is too fragile for real use.

Treat work placement as a deliberate engineering choice

The most useful shift is to stop asking where the task started and start asking where each stage of the work belongs.

Different machines are good at different parts of the job.

Laptop

The laptop is usually the best place for:

capturing the task when you first notice it,
reviewing context,
deciding the workflow config,
and making high-level approval calls.

It is the machine you have with you. That matters.

But it is not automatically the best place for long implementation passes, heavy local builds, or tasks that need a stable always-on environment.

Workstation

A workstation is a good fit when:

the repo is large,
builds are expensive,
multiple local services need to be running,
or you want an execution surface that is stronger than a laptop but still under your direct control.

The value here is not just speed. It is stability. You want the machine running the agent to stay available even if the laptop closes or moves.

Dev server

A dev server is often the right place when:

the task depends on remote services or network reachability the laptop does not have,
the environment needs to stay alive for a long time,
or the machine doing the work should be closer to the actual systems being inspected.

This is where agent work starts to look much more like distributed operations than a fancy local prompt.

Production-adjacent machine

This should be handled carefully, but sometimes it is the only credible place to inspect or validate something.

The point is not to run everything there. The point is to recognize that work placement is part of the task design. “Use an agent” is not a complete operational instruction. “Use an agent on the right machine for this stage of the work” is much closer.

Keep the workflow constant even when the machine changes

One reason multi-machine agent use gets messy is that people let the process change when the hardware changes.

They plan on a laptop, improvise on a server, and then try to remember which quality bar still applies.

That is exactly where graph-based workflows help.

MuxAgent separates workflow config from runtime and machine choice. The workflow config decides the graph: planning, review, approval, implementation, verification, and the allowed recovery paths between them. Runtime selection decides whether agent nodes execute with Codex or Claude Code. The machine choice decides where the work actually runs. MuxAgent’s multi-machine architecture makes this separation both possible and practical.

Those are different decisions.

That separation is what keeps multi-machine work sane. You can keep the same workflow contract while changing the place where the work happens.

For example:

use default when a human should approve the approach before code changes land,
use autonomous when you trust the agent to keep moving without an approval stop,
use plan-only when you want a reviewed plan before touching code,
or use yolo when you deliberately want a fully autonomous multi-wave run.

The important part is that moving from laptop to server does not require inventing a new process. The graph stays recognizable. The checkpoints stay recognizable. Recovery stays recognizable.

That reduces the biggest multi-machine risk: losing the thread of how the work is supposed to proceed.

What MuxAgent changes in the multi-machine story

MuxAgent is useful here because it was designed around the assumption that the human and the machine doing the work are often not in the same place.

The CLI runs on whatever machine is right for the job: laptop, workstation, dev server. You install it where the work should happen. That part is straightforward.

The harder part is what happens after the work starts running somewhere else. How do you stay in the loop when the agent is on a dev server and you are walking between meetings? How do you respond to an approval request when you are not sitting in front of the terminal that issued it?

That is the remote-control problem, and it is where the paired mobile app comes in. (For more on why remote control is a first-class product problem rather than a nice-to-have, see why remote control is the missing layer for AI coding agents.)

The mobile app connects to your machines and gives you a live view of running agent sessions. You can see what the agent is doing, review approval checkpoints, and intervene — all from your phone. The machine doing the work does not need to be the machine in front of you.

This is not about replacing the terminal. It is about keeping you reachable when the terminal is somewhere else.

A practical operating pattern that actually holds up

The strongest multi-machine setups are boring in the best way. They reduce improvisation.

Here is a pattern that works well.

1. Start the task where you notice it

Use the laptop to capture the job while the context is fresh.

Pick the workflow config based on risk and ambiguity, not machine habit. If the task still needs scoping, start with plan-only or a plan-first workflow. If the task is substantial and needs explicit sign-off, use default.

The first machine is often the best place to decide the shape of the work even if it is not the best place to execute every stage.

2. Move execution to the machine that matches the work

Once the plan is sound, run the heavier implementation or verification work on the workstation or dev server that actually fits the task.

Do not ask the laptop to be a dev server just because the session began there.

3. Keep the same workflow contract

Do not silently drop checkpoints because the work moved.

If the plan required approval on the laptop, it still requires approval after the agent is running on the other machine. If verification is part of done, it is still part of done. Multi-machine execution should not become an excuse for a weaker process.

4. Keep supervision reachable

This is the part teams underweight.

A long-running session is only as useful as your ability to inspect it and intervene when it reaches a meaningful branch point. If the only way to check on the agent is to sit back down in front of the original terminal, you have a fragile setup.

Pair the mobile app so you can monitor progress and respond to approval requests from your phone. That is what prevents a session on the dev server from turning into a black box while you are away from the desk. If you have not added a second machine yet, see how to pair MuxAgent on a second machine without friction for the step-by-step process.

5. Bring back proof, not just claims

When the task returns from the other machine, what matters is not “the agent said it is done.” What matters is that the outputs are reviewable:

plan artifacts,
implementation summary,
verification results,
and the relevant build or test evidence.

Machine changes should not weaken the evidence chain.

Multi-machine does not mean multi-everything

There is a bad version of this pattern too.

It looks like:

one branch edited from three machines without clear ownership,
ad hoc approvals in chat,
a laptop transcript that no longer matches the server state,
and a vague sense that the agent is “working somewhere.”

That is not continuity. That is distributed confusion.

Good multi-machine operation is narrower than that.

Pick one machine as the execution surface for the current stage. Keep one workflow graph as the source of truth. Keep the checkpoints explicit. Keep the control surface reachable. Keep the outputs inspectable.

The goal is not to maximize how many machines are involved. The goal is to make machine boundaries stop being a reason for agent work to stall, drift, or become harder to supervise.

The point is continuity, not novelty

The reason multi-machine support matters is not that it sounds advanced. It is that real engineering work does not happen in one chair, on one computer, during one uninterrupted sitting.

You notice issues on a laptop. You run heavy work elsewhere. You leave your desk. You come back. A plan needs approval. Verification fails on the machine with the right environment, not the one nearest your hands. Priorities change while the agent is still active.

That is the normal shape of modern work.

Once you accept that, the right question is no longer “How autonomous is the model?” The right question is “Can the workflow survive the fact that the work and the operator move independently?”

That is what a serious multi-machine setup gives you:

the right machine for each stage,
the same workflow graph across environments,
reachable supervision,
and continuity when the work outlives the original terminal.

That is not an edge case. It is the operating model AI coding agents need if they are going to keep being useful after the demo wears off.