Tasks framework versatility & performance

codingjoe · May 1, 2026, 11:26am

Hi there ,

I hopped on the bandwagon pretty quickly after the django.tasks release, as common interface would make my life as a 3rd-party maintainer simpler. I found out about it pretty late, but I was so grateful that someone (Jake) mustered the energy for such a massive proposal.

However, even though I am working on multiple commercial Django 6.0 projects, not a single one has adopted tasks in half a year. Why?

So, I did what any sane person does and tried building the tools I was missing. First was django-crontask. I was already maintaining its Dramatiq sister project for years—easy transformation, right?

No, that’s when I discovered Django has introduced dataclasses. I love dataclasses, but since they have special metaclasses, inheritance is tricky. For an task framework aimed to be extended by the community, this felt like an odd choice. The feeling was amplified by the fact the dataclasses are frozen. I have yet to uncover why the dataclasses are frozen. Especially since they are not immutable like a namedtuple, and freezing comes at a known performance disadvantage: dataclasses — Data Classes — Python 3.14.4 documentation

Shifting to the TaskResult it’s even stranger. It’s a frozen (emulated immutable) dataclass, but we treat it as mutable. When you call TaskResult.refresh it updates the object in place (using object.__setattr__) instead of returning a new immutable object. There was a review comment about this but to my knowledge it was sadly never addressed. It’s especially odd to me, since Python has a native dataclasses.replace function, which correctly returns new dataclass instances and would have greatly simplified the Django implementation.

Now, let’s chat about efficiency for a minute. For Django’s tasks framework to be a good base, it must be versitile and enable performance. It should be able to handle a few tasks a day, just as good as a couple million per second. Luckily Django 6.1 will have Picklable tasks to support multiprocessing, nice!

Still, I’d like to propose a few more changes to improve both versitility as well as performance:

Unfreeze Task: Dataclasses are a good choice here. Tasks are instanciated during module loading as quasi singletons. People can better use inheritance or even do in-place updates with decorators. Attribute read performance could be improved by slotting them.
Use typing.NamedTuple for TaskResult and TaskContext: You may hold millions of those objects in memory if you trigger map-reduce tasks (or other bulk operations). Dataclasses are objects (with a __dict__ or __slots__), whereas named named tuples are C structs. They use less memory both during runtime or in transport while piping them to different processes. They are also actually immutable, so no more in place updates.
Lazily reference TaskResult.task ( via import_string and a property): Currently every task result holds a reference to a task instance. This can lead to task instances (quasi singletons) being copied. This can create unwanted memory bandwidth and allocation overhead.
Make task results comparable: Tasks have a priority (wonderful!), but don’t implement a comparison method they would need for Python’s native PriorityQueue. I would suggest a default priority LIFO order.

This is a difficult topic and a difficult read. I am genuinely impressed by the work that has been done. But being so complex, I believe it will take multiple iterations to reach a robust framework that best serves the community.

Best!
Joe

codingjoe · May 4, 2026, 3:42pm

Amendment

Drop call / acall: Currently both use asgiref (even if they run outside of an ASGI lifecycle) and are thread-aware (1 thread per execution).

I think people are not aware of the performance consequences of thread-aware sync_to_async calls. I believe developers should make an active and aware decision how to execute a task. IMHO, those functions provide little to no use cases/benefit over just calling Task.func directly but with added control.

theorangeone · May 4, 2026, 8:24pm

A lot to unpack here, so here goes:

At present, unless django-tasks-db or django-tasks-rq work perfect for your use case, the onus on adopting django.tasks is more on library authors than it is project maintainers. The ecosystem needs some solid foundations first before people can start confidently building upon it. For some projects that’s django-tasks-db and django-tasks-rq, and for others it’s not - and that’s ok. I’m not surprised it’s taking time for projects to migrate to or build around django.tasks, but it’s definitely happening. If there are things the framework could do better (perhaps outside the below), please say - I’d love to have some conversations RealOrangeOne/django-tasks · Discussions · GitHub!

It’s not super complex, but I agree it’s not always obvious. django-tasks-db has a custom TaskResult which extends the base and it works just fine - all it needs is a dataclass decorator. I went with a dataclass since the problem at hand fit into their domain nicely, and beings with it a few shorthand niceties. I’m not at all tied to it, and if there’s good reason to convert to a conventional class - let’s! (deprecation details intentionally not discussed).

The main reason is to avoid the foot-gun of people mutating instances unnecessarily and expecting them to be committed. Since there’s currently no API to update results in place, it felt useful. However, it’s a problem people are aware of with ORM models, so again I’m not opposed to un-freezing. I’d suggest unfreezing Task and TaskResult together.

This isn’t quite true. Internally it’s mutated, namely so refresh functions as expected, and because in some cases it’s cleaner than passing everything into the constructor (sure, these might be solved by dropping dataclass). However externally, it’s intended to be considered read-only (subtly different to immutable). Pure immutability wasn’t really a design goal.

TaskResult.refresh was intended to mirror Model.refresh_from_db. You can still retrieve a clean updated instance with get_result, much like you can with the ORM. In practice, I doubt refresh will be called very often, since it assumes the TaskResult lives longer than the task takes to run. Since there’s already get_result, I don’t know that a method which returns a new instance on the TaskResult itself is useful - but I’m happy to be proven wrong.

Now for the changes:

Unfreeze Task: I’m interested in discussing this further to gauge input. Issues · django/new-features · GitHub is probably the place to go for this. Again though, I’d suggest considering TaskResult in the same conversation, since the same merits will hold true.
Use typing.NamedTuple for TaskResult and TaskContext: I don’t think a NamedTuple is right here. It leads to odd APIs which can be surprising to many. attrs has some great reading on this. With that said, I’d be interested in continuing discussions on replacing dataclasses with say a native class, especially if the cons outweigh the pros. new-features repo sounds like a good place for this discussion.
Lazily reference TaskResult.task ( via import_string and a property): This sounds like a great idea to me. Having to import and instantiate the Task for each result is likely fairly expensive, duplicates instances, and is just generally unnecessary work in many cases. It’s probably not quite ticketable yet, so new-features is probably the place to go to gain some wider input on impact.
Make task results comparable: This one is interesting to me - what value do you see in them being comparable? It’s more than just priority - in ORM speak it should be [F("priority").desc(), F("run_after").asc(), F("enqueued_at").asc()]. I’m not opposed to implementing that, but I’m not sure I see the value (at least wide enough to be implemented in core).
Drop call / acall: These are intentionally not part of the public API, and exist as a shorthand to trigger the functions easily, without each callsite needing to consider whether the task is async or not. It’s short-hand, not the intended operation. If a worker needs to grab .func directly, they absolutely should. I know sync_to_async can cause problems, but so long as concurrency within a single thread isn’t too high (which the GIL sort of blocks anyway), the overhead should be minimal - especially since if you know everything is async, you can avoid acall entirely.

Comments like these are exactly what this project needs. I don’t want to be the only one designing django.tasks - it needs other people’s thoughts and experiences to be a useful and “robust” framework. Thanks for taking the time to formulate and write it!

codingjoe · May 5, 2026, 10:47am

Hi @theorangeone,

Thanks for taking your time for the detailed response and providing a little history deep dive.

I hope you can tell that I am a big fan of the work you did and tasks in Django. I think many would agree that Django hasn’t seen such a big new feature since ASGI.

Community adoption

100%, it’s going to take a while, especially since there are very mature options like Celery.
That being said, I secretly dream of a day when we have a Redis queue in Django.

Dataclasses

Again, I do like dataclasses, but since the tasks implementation is swappable, the inheritance quirks are real. My main concern is that you can’t override fields, e.g., with properties, but it’s a small concern.

This isn’t quite true. Internally it’s mutated, namely so refresh functions as expected, and because in some cases it’s cleaner than passing everything into the constructor (sure, these might be solved by dropping dataclass ). However externally, it’s intended to be considered read-only (subtly different to immutable). Pure immutability wasn’t really a design goal.

I see… hm… I don’t think “internally mutable” but “externally immutable” communicates clearly to users. Especially since refresh will update in place, which even for a user makes the object mutable and requires state management for users.

Maybe it’s helpful if we flip a coin and decide on either paradigm.

about the changes…

Cool, I can open an issue there. I thought more about cleanup on Trac, but I am not in a rush.
typing.NamedTuple: attrs distinction didn’t age well. They are typed now. I also disagree with other points, since they focus on usage, not Python’s internal implementation. A tuple is not an object, which makes a big difference for performance. And honestly, this is my only angel. I want the have a smaller memory footprint and quicker serialization. Both are strong suits of a tuple. Yes, they are actually immutable, but as you mentioned before, this might be useful.
Same as 1., but no rush.
This is mainly about providing a base implementation for the magic methods, since the base task does already provide a priority. 3rd party packages will of course need to expand on this.
Oh, ignore my comment then. I thought they were public.

Performance receipts

I am working on a new queue-agnostic worker pool (*like billiard in Celery with a Gunicorn interface): GitHub - codingjoe/threadmill: A queue agnostic worker for Django's task framework. · GitHub
This involved numerous benchmarks. I will try to create some reproducible benchmarks that underline some of my performance concerns.

codingjoe · May 28, 2026, 3:04pm

@theorangeone, I spend some time to perform benchmarks. Each ran at N=1_000_000.

2. A named tuple with the tuple header actually inflates the memory footprint slightly. The slotted object is more memory efficient at scale because of the shared class.

Per-instance size (sys.getsizeof):
  Dataclass:   136 bytes
  NamedTuple:  152 bytes

Original TaskResult (frozen slotted dataclass):
  Time:             1.251s  (1.25 µs/call)
  Mem (peak):       252.20 MB
  Per-instance:     136 bytes (getsizeof)

NamedTuple TaskResult (factory):
  Time:             1.175s  (1.18 µs/call)
  Mem (peak):       275.09 MB 
  Per-instance:     152 bytes (getsizeof)

The performance difference is mainly because of the N+1 call stack overhead.

Now, what REALLY makes a difference is replacing the __post_intit__ for a __new__ factory:

Per-instance size (sys.getsizeof):
  __post_init__:  136 bytes
  __new__:        136 bytes

TaskResult with __post_init__:
  Time:      1.154s  (1.15 µs/call)
  Peak mem:  252.19 MB (for 1,000,000 instances)

TaskResult with __new__:
  Time:      1.363s  (1.36 µs/call)
  Peak mem:  137.76 MB (for 1,000,000 instances)

I assume that the call args and kwargs get allocated twice, while the original ones don’t get deallocated quickly enought.

I will propose that change in a cleanup ticket.

4. The lazy TaskResult.task reference via import_string has no impact on memory. Mainly because the tasks are singletons. So unless you have a million different task functions, there is nothing to be gained.

Original TaskResult (hard task ref):
  Time:      1.238s  (1.24 µs/call)
  Peak mem:  252.19 MB (for 1,000,000 instances)

Lazy TaskResult (string task ref):
  Time:      1.204s  (1.20 µs/call)
  Peak mem:  252.19 MB (for 1,000,000 instances)

codingjoe · May 28, 2026, 3:40pm

JannetGen · June 4, 2026, 8:52am

That matches my experience too. For smaller projects, keeping everything within the Django ecosystem can be a big advantage since there’s less infrastructure to manage. It’s nice to have a simpler option before jumping straight to a more complex task queue setup.

codingjoe · June 26, 2026, 3:06pm

@theorangeone I have a concern about TaskResults.errors: list[TaskError]. It forces the error to be a Python error. However, the error could also be an acknowledgment timeout or a completely unrelated issue with your non-python broker.
We might want to be more graceful with non-pythonic errors and expand the current error tuple implementation. Ref: django/django/tasks/base.py at 99672c672a1537aeb0d1fd5911ca6f04154cc091 · django/django · GitHub

What do you think?

BTW, exception_class doesn’t handle import errors.

jacobtylerwalls · June 26, 2026, 3:58pm

Can you flesh out your idea here? You don’t even have to use a custom error:

>>> from django.utils.module_loading import import_string
>>> import_string("builtins.ValueError")
<class 'ValueError'>

So my initial reaction is that letting the ImportError bubble would be just fine, but I’m presuming you’re about to tell me about a configuration where it’s not fine

codingjoe · June 26, 2026, 4:45pm

@jacobtylerwalls ok, I am working on that super secret thing

Jokes aside, strap in; this will get complicated: I am working on a highly durable message backend that uses tactics like acknowledgment as well as prefetching. If a prefetch lease expires, I can just requeue the task; all good. But if a running lease expires (aka. the task times out midway, e.g., power interrupt), I want to fail the task and just requeue it. A task might not be atomic or, in my case, send millions of emails. If the broker’s reaper finds it timing out, I intend to know.

I wiggled my way around it by implementing Python bindings for my message broker. It will inject an error that Django will be able to parse… but it feels like a dirty hack.

jacobtylerwalls · June 26, 2026, 5:23pm

I guess I don’t fully understand your architecture, I would have thought that any task backend had to be written in Python, and that would be the right layer to intercept and transform errors from whereever.

codingjoe · July 2, 2026, 10:05am

The backend may be written in Python, but the message broker doesn’t.
That being said, after spending a little more time on it: maybe remapping the errors is a good idea. Similarly to how Django maps DB driver errors.

Having Python exceptions is also a good precursor for Jake’s retry-backoff proposal. So, I’d probably keep things as they are.

Topic		Replies	Views
Steering Council vote on Background Tasks DEP 14 Django Internals	21	2291	October 11, 2024
django-tasks - bringing background workers in to Django core Show & Tell	5	4313	March 31, 2025
6.0 Documentation confusion for django tasks. Documentation	9	415	December 16, 2025
Maybe it's just a blind spot, when it comes to async django Async	22	5690	May 5, 2024
Is DEP009 ("async-capable Django") still relevant? Async	44	2991	May 8, 2025