Tanstack Start | In praise of –dry-run

In praise of –dry-run(henrikwarne.com)

142 points by ingve 11 hours ago | 75 comments

xavdid 35 minutes ago
I like this pattern a lot, but it's important that the code in the dry path is representative. I've been bitten a few too many times by dry code that just runs `print("would have updated ID: 123")`, but not actually running most of the code in the hot path. Then when I run it for real, some of the prep for the write operation has a bug / error, so my dry run didn't actually reveal much to me.
Put another way: your dry code should do everything up until the point that database writes / API calls / etc actually happen. Don't bail too early
- marhee 11 minutes ago |parent
  Doesn’t this conflate dry-running with integration testing? ASAIK the purpose of a dry-run is to understand what will happen, not to test what will happen. For the latter we have testing.
  - xavdid a minute ago |parent
    Yes, but it depends on the context.
    For little scripts, I'm not writing unit tests- running it is the test. But I want to be able to iterate without side effects, so it's important that the dry mode be as representative as possible for what'll happen when something is run for real.
mycall 6 hours ago
I like the opposite too, -commit or -execute as it is assumed running it with defaults is immutable as the dry run, simplifying validation complexity and making the go live explicit.
- Twirrim 6 hours ago |parent
  I've biased towards this heavily in the last 8 or so years now.
  I've yet to have anyone mistakenly modify anything when they need to pass --commit, when I've repeatedly had people repeatedly accidentally modify stuff because they forgot --dry-run.
  - IgorPartola 5 hours ago |parent
    I wouldn’t want most things to work this way:
```
    $ rm file.bin
    $ rm —-commit file.bin
    $ cat foo.txt > bar.txt
    $ cat foo.txt | tee —-write-for-real bar.txt
    $ cp balm.mp3 pow.mp3
    $ cp —-i-mean-it balm.mp3 pow.mp3
```
    There is a time and a place for it but it should not be the majority of use cases.
    - Darfk 5 hours ago |parent
      Totally agree it shouldn't be for basic tools; but if I'm ever developing a script that performs any kind of logic before reaching out to a DB or vendor API and modifies 100k user records, creating a flag to just verify the sanity of the logic is a necessity.
      - Joker_vD 5 hours ago |parent
        if [ -n "$DRY_RUN" ] ; then alias rm='echo rm' alias cp='echo cp' fi
        Of course, output redirects will still overwrite the files, since the shell does it and IIRC this behaviour can't be changed.
        digiown 5 hours ago |parent
        set -o noclobber
      - james_marks 5 hours ago |parent
        Yep. First thing I do for this kind thing is make a preview=true flag so I don’t accidentally run destructive actions.
    - digiown 5 hours ago |parent
      For most of these local data manipulation type of commands, I'd rather just have them behave dangerously, and rely on filesystems snapshots to rollback when needed. With modern filesystems like zfs or btrfs, you can take a full snapshot every minute and keep it for a while to negate the damage done by almost all of these scripts. They double as a backup solution too.
    - ronjakoi 2 hours ago |parent
      I used to have alias rm='rm -i' for a few years to be careful, but I took it out once I realised that I had just begun adding -f all the time
    - hdjrudni 5 hours ago |parent
      Even in those basic examples, it probably would be useful. `cp` to a blank file? No problem. `cp` over an existing file? Yeah, I want to be warned.
      `rm` a single file? Fine. `rm /`? Maybe block that one.
      - Izkata 34 minutes ago |parent
        That last one would error without doing anything anyway because it's not recursive.
  - __turbobrew__31 minutes ago |parent
    —dry-run should default to true
- torstenvl 2 hours ago |parent
  I have a parallel directory deduper that uses hard links and adopted this pattern exactly.
  By default it'll only tell you which files are identical between the two parallel directory structures.
  If you want it to actually replace the files with hard links, you have to use the --execute flag.
- spike021 3 hours ago |parent
  There was a tool I used some time ago that required typing in a word or phrase to acknowledge that you know it's doing the run for real.
  Pros and cons to each but I did like that because it was much more difficult to fat finger or absentmindedly use the wrong parameter.
- xyse53 6 hours ago |parent
  Yeah I'm more of a `--wet-run` `-w` fan myself. But it does depend on how serious/annoying the opposite is.
  - aqme28 5 hours ago |parent
    I've done that, but I hate the term "wet run."
    I use "live run" now, which I think gets the point across without being sort of uncomfortable.
    - IgorPartola 5 hours ago |parent
      --with-danger
      --make-it-so
      --do-the-thing
      --go-nuts
      --safety-off
      So many fun options.
      - Darfk 5 hours ago |parent
        I'm a fan of --safety-off. It gives off a 'aim away from face' or 'mishandle me and I'll blow a chunk out of your DB' vibe.
      - torstenvl 2 hours ago |parent
        It's in the UI not the command line, but I like Chromium's thisisunsafe
      - JsonCameron 4 hours ago |parent
        I've done a few --execute --i-know-what-im-doing for some more dangerous scripts
        altairprime 3 hours ago |parent
        May I recommend --I-take-responsibility-for-the-outcome-of-proceeding and require a capital I?
      - altairprime 3 hours ago |parent
        --commit is solid too
    - Quekid5 4 hours ago |parent
      Moist run is the way.
- lazide 3 hours ago |parent
  Just don’t randomly mix and match the approaches or you are in for a bad time.
arjie 7 hours ago
In order to make it work without polluting the code-base I find that I have to move the persistence into injectable strategy, which makes it good anyway. If you keep passing in `if dry_run:` everywhere you're screwed.
Also, if I'm being honest, it's much better to use `--wet-run` for the production run than to ask people to run `--dry-run` for the test run. Less likely to accidentally fire off the real stuff.
- wging 6 hours ago |parent
  One nice way to do things, if you can get away with it, is to model the actions your application takes explicitly, and pass them to a central thing that actually handles them. Then there can be one place in your code that actually needs to understand whether it's doing a dry run or not. Ideally this would be just returning them from your core logic, "functional core, imperative shell" style.
  - WCSTombs 6 hours ago |parent
    I totally agree with both this and the comment you replied to. The common thread is that you can architect the application in such a way that dry vs. wet running can be handled transparently, and in general these are just good designs.
    - IgorPartola 5 hours ago |parent
      That’s what I prefer as well. A generation step and an execution step where the executor can be just a logger or the real deal.
- ryandrake 6 hours ago |parent
  I don't want to have to type rm --wet-run tempfile.tmp every time, or mkdir -p --yes-really-do-it /usr/local/bin
  The program should default to actually doing whatever thing you're asking it to do.
  On the other hand it would be great if every tool had an --undo argument that would undo the last thing that program did.
  - homebrewer 6 hours ago |parent
    That undo program is called nilfs2, which unfortunately never became popular. I'll simply quote the kernel docs:
    > NILFS2 is a log-structured file system (LFS) supporting continuous snapshotting. In addition to versioning capability of the entire file system, users can even restore files mistakenly overwritten or destroyed just a few seconds ago.
    https://docs.kernel.org/filesystems/nilfs2.html
    https://wiki.archlinux.org/title/NILFS2
    https://en.wikipedia.org/wiki/NILFS
  - aappleby 6 hours ago |parent
    Sure, in those cases - but if a command has a chance of nuking prod, you want some extra step in there. Preferably something that can't be muscle-memoried through.
- sh-run 6 hours ago |parent
  I don't like the sound of `--wet-run`, but on more than one occasion I've written tools (and less frequently services) that default to `dry-run` and require `--no-dry-run` to actually make changes.
  For services, I prefer having them detect where they are running. Ie if it's running in a dev environment, it's going to use a dev db by default.
- segmondy 7 hours ago |parent
  this is where design patterns come in handy even tho folks roll their eyes at it.
  - nstart 5 hours ago |parent
    Design patterns are one of those things where you have to go through the full cycle to really use it effectively. It goes through the stages:
    no patterns. -> Everything must follow the gang of four's patterns!!!! -> omg I can't read code anymore I'm just looking at factories. No more patterns!!! -> Patterns are useful as a response to very specific contexts.
    I remember being religious about strategy patterns on an app I developed once where I kept the db layer separated from the code so that I could do data management as a strategy. Theoretically this would mean that if I ever switched DBs it would be effortless to create a new strategy and swap it out using a config. I could even do tests using in memory structures instead of DBs which made TDD ultra fast.
    DB switchover never happened and the effort I put into maintaining the pattern was more than the effort it would have taken me to swap a db out later :,) .
    - tbossanova 2 hours ago |parent
      What about the productivity gains from in memory db for tests though? Hard to measure I guess
  - cake-rusk 7 hours ago |parent
    Design patterns exist to paper over language deficiencies. Use a language which is not deficient.
    - WCSTombs 6 hours ago |parent
      There's some truth to this, since some design patterns can simply be implemented "for good" in a sufficiently powerful language, but I don't find it's true in general. Unfortunately, it has become something of a thought-terminating cliché. Some common design patterns are so flexible that if you really implemented them in full generality as, say, some library function, its interface would be so complex that it likely wouldn't be a net win.
    - awesome_dude 5 hours ago |parent
      Just my two cents - but a general purpose language is going to need to be coupled with design patterns in order to be useful for different tasks.
      I'm using MVC design patterns for some codebases, I'm using DDD plus Event sourcing and Event Driven for others.
      I suspect that you are thinking of a small subset of design patterns (eg. Gang of Four derived patterns like Visitor, Strategy, or Iterator )
    - antinomicus 6 hours ago |parent
      Like what?
      - frogulis 18 minutes ago |parent
        First class functions and iterators are probably examples of what they mean, in terms of language features that obsolete (GoF) design patterns
ElevenLathe 8 hours ago
I usually do the opposite and add a --really flag to my CLI utilities, so that they are read-only by default and extra effort is needed to screw things up.
- eichin 7 hours ago |parent
  I've committed "--i-meant-that" (for a destroy-the-remote-machine command that normally (without the arg) gives you a message and 10s to hit ^C if you're not sure, for some particularly impatient coworkers. Never ended up being used inappropriately, which is luck (but we never quantified how much luck :-)
  - tkclough 4 hours ago |parent
    I like the timer idea. I do something kinda similar by prompting the user to enter some short random code to continue.
    I guess the goal for both is to give the user a chance to get out of autopilot, and avoid up-arrowing and re-executing.
- weikju 8 hours ago |parent
  Came here to say the same
skissane 7 hours ago
In one (internal) CLI I maintain, I actually put the `if not dry_run:` inside the code which calls the REST API, because I have a setting to log HTTP calls as CURL commands, and that way in dry-run mode I can get the HTTP calls it would have made without it actually making them.
And this works well if your CLI command is simply performing a single operation, e.g. call this REST API
But the moment it starts to do anything more complex: e.g. call API1, and then send the results of API1 to API2 – it becomes a lot more difficult
Of course, you can simulate what API1 is likely to have returned; but suddenly you have something a lot more complex and error-prone than just `if not dry_run:`
- scruple 4 hours ago |parent
  Having 1 place (or just generally limiting them) that does the things keeps the dry_run check from polluting the entire codebase. I maintain a lot of CLI tooling that's run by headless VMs in automation pipelines and we do this with basically every single tool.
sixtram an hour ago
I use a similar strategy for API design. Every API call is wrapped in a large database transaction, and I either roll back or commit the transaction based on dry-run or wet-run flags. This works well as long as you don’t need to touch the file system. I even wrap emails this way—emails are first written to a database queue, and an external process picks them up every few seconds.
- sixtram an hour ago |parent
  To continue, this design has additional benefits:
  The code is not littered with dry-run flag checks; the internal code doesn’t even know that a dry run is possible. Everything is rolled back at the end if needed.
  All database referential integrity checks run correctly.
  Some drawbacks: any audit logging should run in a separate transaction if you want to log dry runs.
BrouteMinou 4 hours ago
One of the kick-ass feature of PowerShell is you only need to add `[CmdletBinding(SupportsShouldProcess)] ` to have the `-whatIf` dry-run for your functions.
Quite handy.
CGamesPlay 6 hours ago
For me the ideal case is three-state. When run interactively with no flags, print a dry run result and prompt the user to confirm the action; and choose a default for non-interactive invocations. In both cases, accept either a --dry-run or a --yes flag that indicates the choice to be made.
This should always be included in any application that has a clear plan-then-execute flow, and it's definitely nice to have in other cases as well.
cjonas 6 hours ago
We have an internal framework for building migrations and the "dry run" it's a core part of the dev cycle. Allows you to test your replication plan and transformations without touching the target. Not to mention, a load that could take >24 hours completes in minutes
zzo38computer 8 hours ago
I think dry run mode is sometimes useful for many programs (and, I sometimes do use them). In some cases, you can use standard I/O so that it is not needed because you can control what is done with the output. Sometimes you might miss something especially if the code is messy, although security systems might help a bit. However, you can sometimes make the code less messy if the I/O is handled in a different way that makes this possible (e.g. by making the functions that make changes (the I/O parts of your program) to handle them in a way that the number of times you need to check for dry run is reduced if only a few functions need to); my ideas of a system with capability-based security would allow this (as well as many other benefits; a capability-based system has a lot of benefits beyond only the security system). Even with the existing security it can be done (e.g. with file permissions), although not as well as capability-based security.
alexhans 3 hours ago
Agreed. For me a good help, a dry run and a readme with good examples has been the norm for work tools for a while.
It's even more relevant now that you can get the LLMs/CLI agents to use your deterministic CLI tools.
mystifyingpoi 3 hours ago
I like doing the same in CI jobs, like in Jenkins I'll add a DRY_RUN parameter, that makes the whole job readonly. A script that does the deployment would then only write what would be done.
bikelang 8 hours ago
I love `—-dry-run` flags for CLI tooling I build. If you plan your applications around this kind of functionality upfront - then I find it doesn’t have to pollute your code too much. In a language like Go or Rust - I’ll use a option/builder design pattern and whatever I’m ultimately writing to (remote file system, database, pubsub, etc) will instead write to a logger. I find this incredibly helpful in local dev - but it’s also useful in production. Even with high test coverage - it can be a bit spooky to turn on a new, consequential feature. Especially one that mutates data. I like to use dry run and enable this in our production envs just to ensure that things meet the functional and performance qualities we expect before actually enabling. This has definitely saved our bacon before (so many edge cases with prod data and request traffic).
tegiddrone 5 hours ago
I’m interested to know the etymology and history of the term. Somehow I imagine an inked printing press as the “wet run.”
- hydrox24 4 hours ago |parent
  It seems to have originated in the US with Fire Departments:
  > These reports show that a dry run in the jargon of the fire service at this period [1880s–1890s] was one that didn’t involve the use of water, as opposed to a wet run that did.
  https://www.worldwidewords.org/qa/qa-dry1.htm
- jofzar 4 hours ago |parent
  Interestingly the one place I have seen "dry run" to actually mean "dry run" is using a air compressor to check to see if a water loop (in a computer) doesn't leak by seeing if there no drop in pressure.
gooseyman 5 hours ago
https://news.ycombinator.com/item?id=27263136
Related
taude 6 hours ago
Funny enough, when creating CLIs with Claude Code (and Github Copilot), they've both added `--dry-run` to my CLIs without me even prompting it.
I prefer the inverse, better, though. Default off, and then add `--commit` or `--just-do-it` to make it actually run.
calebhwin 3 hours ago
And it's more important than ever in the age of coding agents.
aappleby 6 hours ago
What if the tool required an "un-safeword" to do destructive things?
"Do you really want to 'rm -rf /'? Type 'fiberglass' to proceed."
- jabroni_salad 4 hours ago |parent
  There is a package called molly-guard that makes you type the computer's hostname when you are trying to do a shutdown or restart. I love it.
- nthdeui 5 hours ago |parent
  Like tarsnap's --nuke command:
```
  --nuke  Delete all of the archives stored.  To protect against accidental
          data loss, tarsnap will ask you to type the text "No Tomorrow"
          when using the --nuke command.
```
throwaway314155 6 hours ago
Sort of a strange article. You don't see that many people _not_ praising --dry-run (speaking of which, the author should really learn to use long options with a double dash).
- CGamesPlay 6 hours ago |parent
  I'm not aware of any CLI arguments that accept emdash for long arguments–but I'm here for it. "A CLI framework for the LLM era"
- analog31 4 hours ago |parent
  I only saw the emdash in the thread link, but I do know that an iPad "wants" to turn a double dash into an emdash automatically. I have no idea how to disable that default.
TZubiri 5 hours ago
I use --dry-run when I'm coding and I control the code.
Otherwise it's not very wise to trust the application on what should be a deputy responsibility.
Nowadays I'd probably use OverlayFS (or just Docker) to see what the changes would be, without ever risking the original FS.
- throwaway290 4 hours ago |parent
  How do you easily diff what changed between Docker and host?
calvinmorrison 6 hours ago
--dry-run
--really
--really-really
--yolo
- homebrewer 6 hours ago |parent
  You'll like fontconfig then, which has both --force and --really-force
  https://man.archlinux.org/man/fc-cache.1.en
awesome_dude 7 hours ago
pffft, if you aren't dropping production databases first thing in the morning by accident, how are you going to wake yourself up :-)