OneToOneField caching behavior

rolandcrosby-check · November 11, 2021, 3:08pm

Hi, the application I’m working on has some model relations that used a nullable ForeignKey to a related model, analogous to this example:

class Address(Model):
    company = ForeignKey('Company', null=True, blank=True, related_name='addresses')

class Company(Model):
    def create_address(self):
        with transaction.atomic():
            assert self.addresses.count() == 0
            Address.objects.create(company=self)

    def do_something_with_address(self):
        if self.addresses.count():
            do_something(self.addresses.get())

def test_create_address_for_company():
    company = mock_company()
    assert company.addresses.count() == 0
    company.create_address()
    address = company.addresses.get()

(aside: I realize that using count() and get() in do_something_with_address() results in an unnecessary extra query, and that this could be written using try/except around get, but I’m leaving that out of the example for clarity. Also on looking at this again I realize that the transaction.atomic in the example doesn’t achieve anything and we’d need to do a select_for_update to guarantee uniqueness of Addresses by company if we don’t have a unique constraint on address.company_id. But all of that is beside the point.)

We realized we could better model our domain if we explicitly made this relationship unique (every Company in this example would have zero or one Addresses) so added unique=True to the foreign key, and everything continued to work nicely, except that Django now loudly warns at application startup that we really should be using OneToOneField instead of ForeignKey(unique=True).

Today I tried migrating this relationship to use OneToOneField, using hasattr to suppress the DoesNotExist that accessing the address field would otherwise raise, like so:

class Address(Model):
    company = OneToOneField('Company', null=True, blank=True)

class Company(Model):
    def create_address(self):
        with transaction.atomic():
            assert not hasattr(self, 'address')
            Address.objects.create(company=self)

    def do_something_with_address(self):
        if hasattr(self, 'address'):
            do_something(self.address)

def test_create_address_for_company():
    company = mock_company()
    assert not hasattr(company, 'address')
    company.create_address()
    address = company.address

I was surprised to find that the test failed, and looked at the test in a debugger to see what happened. The create_address call succeeded, and an Address associated with the test Company existed in the database, but it appears that the hasattr call at the beginning of the test caused the ReverseOneToOneDescriptor representing the company.address attribute to cache the lack of a related object. Specifically, this call to self.related.get_cached_value returned None, causing the Company instance to continue to behave as if it wasn’t associated with an Address.

I can see how there are circumstances where this is the desired behavior – you don’t want to have to hit the database every single time you access the attribute here – but I’m having trouble figuring out how to work around this caching behavior. In the original code with ForeignKey it was clear when the database was being queried: the example test above, as well as any code inside Company that needed to access the related object, would call get() on the addresses RelatedManager. But with OneToOneField, the actual relation/queryset/manager is abstracted away and inaccessible (as far as I can tell) from code that uses the Company model.

How can I access the reverse side of a OneToOneField in a way that guarantees I’m getting the actual value in the database as opposed to a value that may have been cached at some earlier point? Do I need to do something like company.refresh_from_db() in this example? If so, that also seems suboptimal – I don’t care about refreshing the values of the fields in the company database table, I just want to get the latest value from the related address table. Are there ways of working around this that I’m not seeing?

Thanks for reading and please let me know if there’s something I’m missing here.

EO2875 · January 30, 2022, 3:33pm

I’ve run across the same “caching the lack of a related instance” problem. And it gets worse. Mine was with a ForeignKey like this:

author = Author()
book = Book()
book.author = author
author.save()
book.author_id == None
book.save() # Error! Book instance has no author!

So I made a little helper to “refresh” the needed instances of all needed fields:

def fields_to_refresh(model):
    return [f for f
        in model._meta.concrete_fields
        if isinstance(f, ForeignKey)
    ]
def refresh_obj(obj):
    fields = fields_to_refresh(type(obj))
    for field in fields:
        try:
            other = getattr(obj, field.name)
        except:
            continue
        if other is not None:
            setattr(obj, field.name, other)

(This is a simplified version of my production code. The complete version adds some cache and can work with a collection of instances, etc.)

For your case, I’d recommend adding:

self.address = Address.objects.create(company=self)

And even saving self right there if you must.

It would be nice if Django did it for you though.

Topic		Replies	Views
OneToOneField RelatedObjectDoesNotExist exception Using Django	1	10759	December 4, 2020
Setting both sides of OneToOne field using the related_name on a serializer Using Django	2	1111	November 8, 2021
OneToOneField() create and update Getting Started	4	2010	June 27, 2022
How to get all the OneToOne relations if exists, of a model instance? Using the ORM	4	695	November 27, 2023
Natural key serialization of OneToOneField primary keys Django Internals	2	807	February 15, 2023

OneToOneField caching behavior

Related topics