Filtering on model field after aggregating applies the filter first

ayyoubelamrani4 · June 26, 2023, 11:40am

Hi there,

I would love to apply a filter to only keep some specific groups after grouping by and aggregating.

model.objects.values(“model_field”).annotate(count_model_field=Count(“model_field”)).filter(model_field=some_value)

would apply the filtering before the annotations which is not the desired output. Any idea on how to achieve that without raw SQL ?

Thanks.

KenWhitesell · June 26, 2023, 12:12pm

I frequently find in these types of situations that it’s a lot better to address the specifics than trying to talk about it abstractly. It’s probably going to be easier if you provide a sample model and the results you’re trying to produce from that model.

ayyoubelamrani4 · June 26, 2023, 1:56pm

True, here is an example:

class Example:
profile = models.ForeignKey(Profile, on_delete=models.CASCADE)
score = models.IntegerField(null=True)
date = models.DateField()

I want to get the sum of the score over several days after grouping by profiles, generate a rank based on that aggregated score and then keep only the aggregated score/rank of a particular profile:

window = {“order_by”: F(“progression”).desc()}
Example.objects.values(“profile”).annotate(progression=Sum(“score”)).annotate(updated_rank=Window(expression=DenseRank(), **window)).filter(profile=my_profile)

In reality, the filter is applied before grouping by hence wrong aggregations and rankings.

KenWhitesell · June 26, 2023, 2:07pm

Questions based upon your sample:

What filter? I see no filter in this query.
What is “step_count”? That doesn’t seem to be defined here.

(The reason for asking for a complete sample is so that I can try to recreate what you’re seeing, with an eye toward experimenting a little to find a solution.)

ayyoubelamrani4 · June 26, 2023, 2:14pm

Sorry, just corrected that!

KenWhitesell · June 26, 2023, 4:57pm

Ok, I can see the issue that you’re reporting. I’m not immediately seeing any direct solution to this.

Couple more questions for you - what database engine are you using?

Do you already have the raw SQL that would return the desired results? (It might be easier to reverse-engineer the ORM query to match it.)

KenWhitesell · July 7, 2023, 11:59am

I suggest you open a new topic for your question. When you do, please include your model and a minimal example of what you’re trying to achieve.

srnangi · November 8, 2023, 8:24pm

Is this issue resolved? I am facing a similar problem. I want to filter based on the model field after filtering on the aggregate field. But, django is filtering the model field first and then the aggregate field leading to errors in the retrieved query set.

KenWhitesell · November 9, 2023, 4:07pm

I’m guessing the solution is going to involve some type of subquery expression here, but I’m still having a difficult time understanding what the objective is. It would be extremely helpful if you could provide a minimal code example, with data showing what the results you are looking for, and compared to the results you are getting with your current query.

Topic		Replies	Views
Sum with filter on related queryset Using the ORM	1	3495	September 1, 2022
How to Aggregate + Count? Using the ORM	1	1471	December 4, 2023
Aggregating a windowed queryset using the ORM Using the ORM	4	837	April 23, 2022
Django queryset filter with Rank Using the ORM	13	7188	November 3, 2022
Unable to use group by correctly ORM	4	3229	September 22, 2019

Filtering on model field after aggregating applies the filter first

Related topics