Ask Us Anything

Questions have been edited slightly for clarity.

Hunt Writing

When did meta/feeder puzzle writing begin for this hunt?

On April 4, we did a first testsolve for both Forward and Hibernating and Flying South. So let’s say that we started writing metas in late March.

Feeder writing began in earnest in mid-May. (There were some puzzle ideas that people were passionate about and were working on regardless of which answers were assigned—but meta answers were finalized and assigned in mid-May.)

How long does it take you all to put this together? Do you do it as a full-time job? Are you paid? Do donations cover your cost?

We started planning in February, holding weekly meetings. The total time invested is probably on the order of low-thousands of hours. We all volunteer our time freely and support ourselves through other means. Money that you donate to us is used for hunt-related expenses, such as server costs or individual puzzle costs.

Each puzzle in this hunt feels like a mini hunt with so much information. How long does it take to testsolve and check to make sure there are no mistakes?

Every puzzle gets two clean testsolves (meaning that the solvers are not spoiled on any of the mechanics, and they don’t get any hints from observing authors/editors), and then it is separately factchecked (meaning that another person combs through the data in the puzzle or the logical reasoning and verifies that it is accurate). If a testsolve is stalling, then an observer steps in and does one of two things. Either they will ask the testers to come back later after changes are made to the puzzle, or they will provide a hint with the hope that the group can continue to test the remaining part of the puzzle.

How long the factchecking process takes depends a lot on the puzzle. A “fast” puzzle requires only verifying crossword clues or unambiguous steps. Other puzzles take significantly longer. For example, factchecking Hibernating and Flying South required setting up an emulator and playing around in the game. The factchecker enjoyed this, but it took awhile.

How many resets did your testsolvers need for the puzzles in the World’s End Concert round?

378

How much suffering did you anticipate, versus how much seemed to happen? In particular suffering about (a) website uptime/lag, and (b) that teams needed to carefully manage resets and page turns/wormhole jumps.

We didn’t anticipate that the website would have serious problems. On the other hand, we were thinking about how to make the time loop fun to interact with from the beginning. One of our main points of discussion when deciding on a theme was whether the extra theme components would be “worth it” and would contribute positively to the core puzzle hunt experience.

Which round was the hardest to design/write/code?

World’s End Concert. For one, the metapuzzle was tested and tweaked many times in order to have the right difficulty. For two, there were a lot of tech requirements for the round and the puzzles in the round.

Is innovating hunt formats a primary goal for your writing team, or does it just kind of happen every year?

One of our main goals is to innovate, and the way we did that this year was by the unusual global structure. In past years, innovation has appeared in different ways (for example, in 2020, we decided to focus on unique round concepts).

Most importantly, I really want to see more of this bad puzzle ideas Discord channel (as referenced in the emoji crossword solution).

The hunt writers’ Discord is a sacred place to which entry is not easily gained. The 2020 puzzle Crosswerd also came from this channel (surely there are others too).

Were there older versions of any of the puzzles made using game engines that didn’t make it into the final version of the hunt?

No.

Any stories about puzzles changing, e.g. to fit into the Exam or World’s End Concert round structures?

Perhaps surprisingly, no. When we chose this theme, we knew that the Exam was going to be the way that it was, and we wrote puzzles specifically for its mechanic. Similarly, we decided that we wanted to do the World’s End round pretty early on, and puzzles were built with that knowledge. Of course, not all the puzzles ended up using the mechanic very heavily, but we think that bit of variety was probably for the best.

Writing a puzzlehunt is a different beast than an individual puzzle. Do you have any good resources for aspiring hunt writers?

The best resource is a group of people who will encourage, motivate, and inspire you. Everyone on this team has taught me so much, and I’m happy to have become their friend. If you have dreams, reach out to your teammates and the broader puzzle community! Throw ideas around! Write puzzles together! Testsolve each other’s puzzles!

A potentially less sappy answer from a different member of the writing team:

Hunt Design Decisions

Galactic Puzzle Hunt is traditionally a difficult hunt. How consciously do you make an effort to generate high difficulty in writing puzzles?

We tend to write difficult puzzles, but we don’t particularly try to. We have a general sense of “what makes a good puzzle”. Often, these puzzles end up being hard, but sometimes they’re fun and relatively straightforward (notably, in the case of horse_ebooks).

How did the leaderboard actually work?

Before you complete The Exam by solving The Square, you can only see teams’ progress in The Exam. Afterwards, you can see how many puzzles every team has solved overall. Your Exam solve count goes back to 0 each loop, but your 2021 solves are preserved.

Was there an explicit goal to create more interactive puzzles this year? It seemed like a large proportion (a majority?) of the later puzzles were interactive.

No, this happened naturally, probably due in part to the World’s End Concert round being well suited for interactive puzzles.

What happens if we submit the second answer for an Exam puzzle first? Did any team do that and get confused? How did you design it in such that the first answers are always identified first, and how did you dare release it this way and not worry about causing a lot of confusion?

If you submit an alternate answer first, then that answer is marked with a “?” instead of a check or an X, and the response “SUBMISSION is incorrect” appears glitchy. While we suspect that teams were rightfully confused about what is going on here, we hoped to lead them as naturally as possible to discovering the second answers. Because of the extra level of misdirection going on in The Exam, we explicitly tried to write easier puzzles for The Exam than those in intro rounds for past Galactic Puzzle Hunts.

As for how we dared to do it this way, to be honest we weren’t entirely sure if it would work out.

Did you suspect that The Line’s answer would be very guessable before hunt?

Yes.

What other time loop mechanics did you consider using (unless you want to save those ideas for future hunts)?

Some rough ideas are in the theme proposal doc. We had six meta proposals, of which four were chosen for the hunt.

How did the concept for World’s End Concert develop? The individual round theme seemed pretty different from the rest of the hunt.

The progression of ideas was: asteroid impact → the asteroid is a “rock concert” → the song “4 Minutes” has a nice time element to it → metapuzzle attempts using the song “4 Minutes”, and the idea to solve each puzzle in four minutes. Another branch that was explored was using the song “Time Warp” from the Rocky Horror Picture Show.

Where did the idea to have an opt-out of the timed unlocks come from?

Several teams emailed us requesting the feature! This feature is something that we will likely incorporate into future hunts.

Web

How did you do all the website code? Detailed write-ups of how you put the website together are always very interesting.

The starting point was our publicly-available codebase we’ve used for the last few years, gph-site, which uses Django (a Python library for web apps). On top of gph-site, we built in a lot of customization for this hunt. First, we needed to update the backend to handle the time loop. This required adding to the Django database models, and carefully managing and routing content between the 2021 and 2022 versions of the site. Second, we implemented a lot of frontend features for polish. This included the glitch text and effects on the site, the “Reset” screen when the time loop resets, and the cool animation on the 2021 Home page. These things were made using a mix of CSS and Javascript for styling and animation.

What happened with the website?

We still don’t entirely know! Our best guesses are in the wrap-up.

How did you load test the hunt and the interactive puzzles?

We didn’t specifically do load testing this year, although we got some insights about likely bottlenecks during testsolving.

Hints and During-Hunt

Which puzzles played much easier or harder than you anticipated?

The Earth Guide turned out to be significantly easier than we expected. Shared Samples and Broken Jukebox ended up being harder than we expected.

How did teams get help if they couldn’t finish enough of Piano Typer?

If they asked for a hint, we gave them the reward for beating a level.

How are the hint requests managed? Is it one big queue or is it split on smaller queues depending on which puzzle it is related to?

The hint requests go into one large queue. Each hint request that comes in has information about which puzzle it is for, which team asked it, and whether it’s a follow-up or a new hint.

WHO ANSWERS HINTS? How can you answer hints in 5m at 4am?? I was in 2 wildly different time zones during the hunt, and there was still no time that it took more than 10 minutes.

Answering a hint can be as easy as clicking a button to copy a past response, then clicking again to submit. Yannick Yao and Max Murin answered most of the hints this year. #freeyannick #freemax

What is your stance on trying to stream puzzlehunts?

We don’t allow it during the hunt because the puzzle answers should not be publicly available while the hunt is running. On the other hand, we like streaming as a medium and want to encourage it in a way that follows the rules. For example, recording a stream during the hunt and editing it down for later release is allowed, and so is streaming after solutions are available.

Specific Puzzles

What’s your favorite Touhou song?

Last Remote

How did you determine which teams to use for Galactic History Exam? I heard you had to write a lot of clues for different teams. What happens if TRALSE didn’t finish first? J is not an easy letter to clue. Did every team get a custom clue for their team name? That’s… amazing.

The algorithm tries to use your own team name, and then greedily takes teams from the ends. In retrospect perhaps it would have been better to take teams near your own team when possible. In order to make this work as often as possible, one goal was to write at least one clue for every team. For the ~600 teams that signed up before the start of the hunt, we were able to write clues for about 450 teams. Some teams were too hard to find a one-letter change, others had a strange format, and some others we just didn’t have the time to write a clue for. Due to the website crashing/slowdown going on, we decided not to add additional clues in the middle of the hunt unless absolutely necessary.

We also wrote a J clue for exAM TEsters and several other teams that have historically done well in previous GPH’s. In the event that a team was leading that didn’t have a J clue ready, we probably would write a clue for them and quickly push that change to the website! That said, there are a few failsafes—if a team didn’t have a clue written for them then it would default to trying to take the topmost ranked teams that works (and if that fails, there was a fixed puzzle where we give you the ordering directly, instead of using the leaderboard order.)

It’s crazy that Galactic History Exam exists. In order to make it, did you retroactively change the rules on each GPH site or did you plan for 5 years to make a history puzzle with QUINAPALUS as the solution?

We didn’t retcon anything on the GPH site. QUINAPALUS was intentionally placed, though we really only started planning for that around 2018 (for 2017 it was mostly a throwaway joke). Anderson also ensured that Rock Lobster was in every Spotify playlist in the hopes that it could be used as an index at some point in the future. It was a coincidence that the only letter in puzzle titles that doesn’t appear in either 2017 or 2018 was a J (which meant that we couldn’t index into titles (though we did think about trying to index into dʒʌmbəl), but did eventually motivate the current extraction). The authors’ notes go in a bit more detail about how we planned things for this puzzle.

What is one emoji you really wish you could have used in ✝🔤🧩?

The fridge emoji: 􏿾

How were the words for Reap What You Sow selected/generated?

See the author’s notes.

Did the eidolons being in those specific days for A Winding Walk matter?

No.

Favorite Mumbo Token transformation?

Crocodile

In Pocket Guide to Travel, what do you think was the effect of captioning all those images with alt text on the sighted solvers? Did it obviate the identification step too much?

We’re not sure. The alt text was initially MUCH more obvious (basically just giving away the answers), but we rewrote it to obscure the answers a bit more. We hope that we were able to make the puzzle accessible without creating too much of a difference in experience between a sighted solver who read the alt text and one who didn’t.

Since Piano Typer featured (well, kind of) classical music, I gotta ask—any recommendations for classical music? Favorite pieces of such?

The author has a Spotify playlist of similar music, and also normal classical playlists.

For other examples of “kind of classical” music, we recommend Lionel Yu’s Nightmare videos (start here), The Piano Guys, Lindsey Stirling, 2CELLOS, Vitamin String Quartet.

Puflantwe tacwlafwe puflantu tacadwl-ka zovwe dokfe odeleswe jadif na? (Are there any differences in how puflantu speakers speak puflantu? Put a different way: does puflantu have dialects?)

The galaxy is very big. In addition to dialects, other galactic languages incorporate (or are incorporated into) some words of puflantu. Spoken and written puflantu has evolved over time. I should also mention that there are puflantu keyboards, puflantu dictionaries, puflantu emoji, a puflantu text-to-binary encoding, puflantu ciphers, and many quite excellent works of puflantu literature and poetry.

We had an interesting time trying to connect some of the puzzles to Pokémon.

We like Pokémon too.

Out of the dishes served in The Dining Philosophers, which are your favorites / do you have any good recipes?

I made ratatouille recently but it did not look like the fancy Pixar version so I was disappointed.

For Combat Rollers, we were unable to figure out exact rules for computing the odds, but we were still able to solve the puzzle using 2 rules:

  1. Words with no shared letters have a 0% draw rate. Ergo, if two words have a draw rate of 0% , they share no letters. By process of elimination, a letter bank of an unknown word can be formed.
  2. Consistent input gives consistent output. Ergo, if known word A and known word B give a win-lose-ratio of X in practice mode, and known word A and unknown word C give a win-lose-ratio of X in campaign mode, then unknown word C is known word B. This can be used to confirm the words generated from the letter banks.
Did you expect solvers to approach the puzzle in this way?

At least one testsolving group used strategy 1 (although they also figured out the math so they can use nonzero tie probabilities as well).

Why do you guys like writing guess-and-check submission puzzles so much?

(Assuming that you are referring to Three and The Answer to This Puzzle Is...) The Answer to This Puzzle Is… was written to play with solvers’ expectations about how the answer checker works. Three was written to be a puzzle about history with a fun presentation (and a small thematic callback to The Answer to This Puzzle Is…, with the same leading question).

Random

What games do you recommend, apart from Baba is You, Opus Magnum, frogorf and Touhou?

Ghost Trick needs to be first on this list, but the following are in no particular order: Outer Wilds, Myst, La-Mulana, The Witness, Talos Principle, Stephen’s Sausage Roll, Braid, Superliminal, Snakebird, SpaceChem (and other Zachtronics games), the Ace Attorney games, Room to Grow, Recursed, Patrick’s Parabox, Return of the Obra Dinn, Jelly no Puzzle, Taiji, the Zero Escape series, the Portal series, the Professor Layton series.

What defines a puzzle, to you? (This question is in regard to Piano Typer.)

To me, a puzzle is something with a short solution path which is hard to find. A puzzle should have a definite completion (usually the answer). A good puzzle should be fun to try and rewarding to complete.

Is the timey wimey stuff over? Will the 2023 hunt actually be the 2023 hunt?

We can’t predict the future, but at least the Galactic Cinematic Universe has now filled in the gaps between 2017 and present. (Not to mention 2009.)

Is it unethical to repeatedly submit stupid answers? If you hate it, we won’t do it next time.

We love to see it. Though the number of guesses remaining might not…

Why isn’t your favorite Pokémon Unown R?

Because it’s not as good as Bulbasaur!

Have you “emoji kitchen’d” the ✈??

When are you going to reprint MacGregor House into a new hunt?

2048

Is ✈✈✈ Galactic Trendsetters ✈✈✈ going to sandbag if they end up taking the lead on Sunday in next year’s Mystery Hunt?

Who can say!

Will GPH 2023 consist of 40 puzzles that all have the answer of HALLEY’S COMET?

Hey, that’s a pretty good idea. Or maybe just the answer ANSWER.

From the writer’s perspective, what are everyone’s favourite puzzles?

Everyone writes the kind of puzzles that they like the best :)