Connection lock hanging after upgrading psycopg

Alexerson · May 4, 2024, 10:25pm

I’m a bit puzzled by this one.
I recently tried upgrading psycopg for 2.9.9 to 3.1.18. When running my test suite, some tests still passes, but some of them are just hanging, blocked, never ending.
I added pytest-timeout to try to understand what was up, and I can see the runner is blocked in:

        Execute a query or command to the database.
        """
        try:
>           with self._conn.lock:
E           Failed: Timeout >60.0s

../../../.pyenv/versions/3.12.2/envs/braindate-api-3.12/lib/python3.12/site-packages/psycopg/cursor.py:727: Failed

So I’m assuming the connection can’t get a lock on the database now? Some queries work though, so it seems that it’s only after X queries that I have the issue.
I’m still trying to find a good minimal reproduction, but if anyone has clues as to what I can do to debug this, it would be appreciated.

What changed between v2 and v3 that can trigger this? This could help me debug it too.

DATABASES = {'default': {'NAME': 'test_project', 'USER': 'project', 'PASSWORD': '###', 'HOST': 'localhost', 'PORT': 5432, 'ENGINE': 'django.db.backends.postgresql', 'ATOMIC_REQUESTS': False, 'AUTOCOMMIT': True, 'CONN_HEALTH_CHECKS': False, 'OPTIONS': {}, 'TIME_ZONE': None, 'TEST': {'CHARSET': None, 'COLLATION': None, 'MIGRATE': True, 'MIRROR': None, 'NAME': None}}}```

KenWhitesell · May 4, 2024, 10:45pm

Are you running everything sync? Or are there some parts of this that are async? Are there any configuration or settings that you’ve changed? Are there any other changes you’ve made other than just changing psycopg?

Alexerson · May 4, 2024, 11:02pm

I do have some async views and tests in the app, and this was my first thought as well. But the failure happens even if no async test/view is executed.

Alexerson · May 5, 2024, 12:09am

Ok, so I looked a bit more and actually, the issue seems to be when I use a JSONField with a custom encoder.
In the encoder, I do this (among other things):

    def default(self, o) -> Any:
        if isinstance(o, models.Model):
            return {
                'content_type_id': ContentType.objects.get_for_model(o).pk,
                'id': o.pk,
                '__type__': 'Model',
            }
        return super().default(o)

And it seems that the call to ContentType is done while the lock is used by the save query already.

At least this gives me an idea of what to fix (not yet why exactly it wasn’t happening before, or how to do it).

Topic		Replies	Views
Intermittent PostgreSQL connection errors Mystery Errors	2	1082	June 21, 2024
Error/problem with psycopg3 and server-side cursors Using the ORM	2	1236	August 11, 2023
django.db.utils.InterfaceError: connection already closed when updating from django 3.0 to 3.1 Deployment	22	8611	May 21, 2024
Psycopg performance issue with Django 4.2 Using the ORM	4	1290	April 12, 2024
Django 4.2 vs 3.2 psycopg2 poor cursor performance ORM	4	546	April 15, 2024

Connection lock hanging after upgrading psycopg

Related topics