The Summer of Codex

3 months ago 10

LLMs are here to stay and if we believe Kurzweil's law of accelerating returns proposed in his book The Age of Spiritual Machines, then theoretically, they will improve at an exponential rate. However, their utility will depend on how proficient one gets at using them. LLMs are, in my opinion, a superset of a tool—much like a Swiss Army knife but with seemingly infinite sub-tools.

When I started writing code for Paylias, vibe coding wasn't as mainstream and the models weren't as sophisticated as they are today. I think GPT-3.5-turbo had just launched and people had started to scratch the surface of using LLMs for code generation, but agentic coding didn't exist (or if it did, I was too cheap to buy the subscription). So this meant that when Zed (my editor of choice) integrated agents into their editor, the codebase for Paylias had already grown to over 80,000 lines of Go code.

Given this, I wasn't going to let an AI model immediately start editing the codebase that I had, for the past 2 years, painstakingly worked on—but I was intrigued. I started getting help from LLMs to improve existing architecture for certain services, find bugs in my existing code and, on the rare occasion, let it write some unit tests.

To be honest, it was like a drug at first. The speed of the output and the confidence in the LLM’s own answers made me start using it more and more. Up until the point my free credits ran out. I think not purchasing a paid version of Cursor or Zed has prevented me from becoming an addict, so to speak. But I may be hindering my own velocity when it comes to shipping code—a dilemma I'm still coming to terms with.

Then OpenAI launched Codex.

Originally too expensive for me to purchase on their Pro plan, I shrugged it off and stuck with Zed for the time being. But when they released it to their Plus plan, boy was I ready to give it a shot. By this time, I had improved my prompting skills and figured out what works and what spits out garbage, so I figured Codex could really be useful for me. The thing that excited me about Codex over Zed and Cursor is that if you configure an environment and give it some instructions, Codex will run in the background, attempt to follow and implement your instructions, and create a pull request for you. While it sounds skeptical to leave an LLM to its own whims and expect polished output as opposed to incremental changes while vibe coding, I've found that if you're specific with what you want, Codex will get you 90% of the way there—and while it's working in the background, I can, in parallel, work on defining the next task.

Let me give you an example. The other day I wanted to modify our Customer resource to also accept both individuals and businesses (currently, only individuals were supported). After I had defined the properties that I wanted to support for businesses, I knew that the implementation was a fairly monotonous task. I could do it myself, as I've done countless times for other resources—but I had to go out with my wife, and so I wouldn't have the time to sit and implement the changes. Instead, here's the prompt I came up with:

we want to modify existing files in the code base to support new valus for the Customer object. The new customer object can be found here [REDACTED]
The newly added properties are
- phone_number
- business_legal_name
- business_website
- business_description
- business_trade_name
- business_industry
2. Modify [REDACTED] to remove the `WithCustomerNickName` option and add support for missing options based on the newly added properties. Where applicable, for instance if the property is meant to be an enum, ensure that only valid enum properties are provided. For instance, the `0` value enum property should not be supported. Additionally, change the WithCustomerStatus function to accept the dsp.CustomerStatus instead of a string.
3. Modify [REDACTED] to add support for the new properties in both CustomerToModel and ModelToCustomer. Also remove support for Nickname from both functions since that is no longer supported.
4. Modify [REDACTED] to ensure that both CustomerCreateRequest and CustomerPatchRequest support the new properties and also add appropriate validation based on the value definition for each new property for example if a value is an enum, ensure that only accepted values can be passed in.
5. Modify [REDACTED] to fix the DeleteCustomer, PatchCustomer and CreateCustomer functions. Things to note
- `PaymentId` has been changed to CustomerId
- `Nickname` is no longer supported
- `Status` is now an enum
- In the CreateCustomer and PatchCustomer, ensure that there is support for the new values provided above.
6. Modify [REDACTED] and change the FindCustomer(ctx context.Context, txn *sql.Tx, partnerID string, paymentAddr string) to accept the `customerID string` instead of the `paymentAddr string` and change this line in the function `dsp_models.SfpyDSPCustomerWhere.PaymentAddress.EQ(paymentAddr),` to be `dsp_models.SfpyDSPCustomerWhere.Token.EQ(customerID),`. Consequelty change the customer repository interface here [REDACTED] as well to match the new FindCustomer method
7. We have new customer search filters defined here [REDACTED]. Ensure these are now supported in the following places
- [REDACTED]
- [REDACTED]
- [REDACTED] in the makeSearchCustomersQuery. Ensure that support for Nicknames is removed
8. Modify [REDACTED] to support the Search Filters in the `SearchCustomers` function as well as remove the NickNames search filter.
9. Modify [REDACTED] to fix the FindCustomer and replace PaymentId with CustomerId
10. Fix any failing tests here [REDACTED]
11. Modify [REDACTED] to fix DeleteCustomer UpdateCustomer and CreateCustomer functions. Things to note
- PaymentId has been changed to CustomerId in the CustomerPatchRequest and DeleteCustomerRequest.
- rename paymentID in both DeleteCustomer and UpdateCustomer to customerID and update the CommandsHandler interface as well
- in the UpdateCustomer and CreateCustomer functions, ensure that the newly added values are supported
12. Modify [REDACTED] to
- fix references to PaymentId. Change this to CustomerId and change the FindCustomer function signature to rename paymentID to customerID and make the corresponding change to the QueriesHandler interface as well
13. Ensure that the service builds by running the following commands
make run-rpc
make run-gateway

To be honest, it took me about 60 minutes to write this prompt, which is short relative to the amount of time it would have taken me to implement it, but it’s a decent amount of time investment. Add to this that I wanted Codex to implement this in a single shot, so I was focusing on being accurate and ensuring that I hadn't missed a step.

I'm what the Swedish call a tidsoptimist and that day was no different. With the clock ticking and my wife giving me angry looks, I pasted this into Codex and closed my laptop. In the evening when I returned home, I checked the output and to my surprise, Codex had done exactly what I wanted it to. A quick review, merge, and slight manual modifications later, I was able to push up support for business customers in a day while still being able to spend time with my family!

Most code, once your repository gets to a certain point, can be seen as a series of steps that can be repeated to an extent. For example, if you're building a CRUD API, almost always you'll have to:

Create a migration file to store data to and retrieve from a database
Generate a model (optional depending on whether you're using an ORM)
Create an internal representation of that model and one that can be exposed to the client
Create repository functions to CRUD that model from the database
Create functions to manage the business logic around this model—for instance, ensuring uniqueness, ownership, etc.
Create representations of how you expect data to be sent to your APIs from the client to interact with your model
Expose the endpoints through your routing layer
Wrap the routes in any authentication middleware you have

Add to this any files you want to modify or create, and tools like Codex can manage almost 90% of this work to a surprising degree of accuracy. Add to this an AGENTS.md file that locks in your coding preferences, choices of libraries, and coding conventions, and you've got yourself a pretty good setup.

I've been using Codex quite frequently when these kinds of repetitive tasks come up, but I want to improve my prompting techniques. I've been looking at this guide from Foundry and the now infamous Lyra prompt to see what I can borrow and try with my setup.

Curious to know what's worked for you and what hasn't. I do still think that autocomplete suggestions in code editors have a long way to go...

Read Entire Article