Django and concurrency

JavadMoradkhah · July 3, 2024, 8:48pm

Hello everyone
I’ve been working as a Node.js developer for a while, a few months ago for some reason I decided to learn Django.
As you already know Node.js works asynchronously but Django does not (Not by default at least).

Here is my question: since Django runs under WSGI and WSGI can only process a single request at a time, let’s say we have an endpoint and every request takes 10 seconds at least, If I have only one service worker, what happens in this case? Does this mean my Django server could only process 6 requests per minute?

I appreciate someone could clear this up for me

maqnius · July 4, 2024, 6:03am

Hey,

In general: yes. But in practice that’s not as problematic as it seems. Because

a) you don’t want to have a request taking that long anyway. So you would use caching to speed it up or a separate task scheduler (celery usually) to execute long running tasks independently from the request cycle.

b) you can spin up as many worker threads as you want (using eg. gunicorn). Those are not running in parallel (thanks to python’s GIL) but are interleaved. That works well if your task is I/O bound and therefore is waiting for eg the database most of the time.

KenWhitesell · July 4, 2024, 11:00am

And

c) You are encouraged to run multiple worker processes as well, up to 4 times the number of cores you have on your CPU. These separate processes do run in parallel, and so can handle those cases where your tasks are somewhat CPU bound.

JavadMoradkhah · July 4, 2024, 1:12pm

Thank you for your response
I’m fully aware that I should move long-running tasks to a background job (e.g. using celery and rabbitmq) and speed up my database access using indexes, caching, etc and I should have multiple worker processes.

My projects are I/O bound most of the time and I’m concerned about threads being idle and the fact that other requests must have to wait until the previous request finishes.

What if my database gets over 2 million records?
Was that a bad idea to use Django for my I/O bound project?
Should I use Node.js instead?

By the way, My project is a doctor appointment booking system
Thanks for the clarification

KenWhitesell · July 4, 2024, 1:33pm

Do you have actual metrics demonstrating a problem? Or are these simply conjectures based upon other information?

Trivial, if your database, models, and queries are designed correctly. PostgreSQL is going to handle that without breaking a sweat.

Since the work to retrieve that data depends upon the database and not the application framework, it doesn’t matter which framework you choose.

Not at all. If you’ve got delays caused by the size of the database, then you’re looking at solving the wrong issue. If you’ve got problems managing only 2 million rows, then your issue is the database and not the framework.

That’s your decision - and given that at the scales we’re talking about, the choices are effectively identical from a technical perspective - it’s a decision that should probably be based on non-technical factors. (Knowledge of the frameworks and associated libraries, comfort with the ecosystem, etc)

JavadMoradkhah · July 4, 2024, 1:39pm

Thank you so much for the clarification
So essentially it’s not a big deal most of the time by leveraging multiple workers

KenWhitesell · July 4, 2024, 1:41pm

It has never created a problem in the 10 years that I have been deploying Django projects.

JavadMoradkhah · July 4, 2024, 1:42pm

Thanks again for your response

Topic		Replies	Views
Django async views Using Django	3	1042	September 21, 2021
Upload files + main thread Using Django	2	1255	January 12, 2021
Django - Using asynchronous features Using Django	6	1083	May 21, 2020
Is Django development server single-threaded? Deployment	24	8811	May 2, 2024
Microservices and multitasking Using Django	1	829	August 22, 2021

Django and concurrency

Related topics