Idea to explore: Partial migration operations

lorinkoz · October 9, 2024, 7:43am

Inspired by this DjangoCon US 2024 talk and some personal struggles with the topic of zero-downtime migrations, I’d like to explore the idea of “partial migration operations”. I will use the use-case of deleting a model field as example, maybe there are more applications. In any case, this post is just a playground to explore the idea.

In order to delete a model field with zero downtime, one must do a bit more than what python manage.py makemigrations does when removing the field from the model code: make the field nullable, remove the field from the state and finally remove the field from the database.

So here’s the idea. Imagine one could delete the field from the model and then do: python manage.py makemigrations --partial. This would generate a new RemoveFieldOperationPart1 (please forgive the name) operation in a migration. This migration would do a batch of safe operations to be done in a deployment (e.g. nullify and remove from state). After a deploy we could do python manage.py makemigrations --partial again and now the field does no longer exist but the state knows there is a RemoveFieldOperationPart1 so it would generate a RemoveFieldOperationPart2Final, which would finally remove the column from the table.

The --partial argument would be necessary to not be generating the part 2 until explicitly asked for (aka after a deployment). I could also imagine that if there are pending partials from multiple places, the makemigrations command could be interactive in order to choose which subsequent partials to include.

What do you think?

lorinkoz · October 9, 2024, 7:47am

Oh well, maybe a duplicate of Let's talk zero-downtime migrations ? In any case, I know the topic has been abundantly discussed and different approaches for zero-downtime migrations explored+implemented. Not sure if this specific idea has been explored though. If this already rings a bell to you and leads to a deadend, please let me know.

charettes · October 9, 2024, 11:00am

Hello @lorinkoz, I do think this is duplicate discussion to the thread you’ve linked.

I believe the minimal changes that would be needed in core to achieve what you’re describing are demonstrated in this package. The primitive needed to make it work are

Add the notion of stage for migrations to allow the framework to create distinct operation for each one (e.g pre and post deployments).
Adjust the auto-detector (aka makemigrations) to take advantage of the notion of stage by producing migrations partitioning operations by it.
Adjust the migration executor (aka migrate) to be able to run in either stage (pre and post deployments).

I think it would be better to continue the discussion over the other thread though to avoid fragmenting the discussion.

lorinkoz · October 9, 2024, 11:44am

Alright, thank you! I will cross reference and follow up on the other one!

Topic		Replies	Views
Let's talk zero-downtime migrations ORM	11	1881	October 9, 2024
Backwards compatible migrations Using Django	4	3558	October 31, 2024
Why does the deletion of an unmanaged models create migration? ORM	4	1955	September 13, 2022
Ticket #24686 - Support for Moving a Model between two Django Apps - Implementation Approach Feedback Mentorship	8	739	March 19, 2023
How do i remove a field from a model inheriting from another model if migrations already set the `bases` attribute in CreateModel Using the ORM	4	1923	October 11, 2023

Idea to explore: Partial migration operations

Related topics