Why YOLO model loading for every call and how can I solve?

iskenderk · April 15, 2023, 4:02pm

Hello,

I am working on a project that uses Django as an API.
I wrote two functions in views.py.
The first function is for the POST method.
This function, taking an image url, downloads and saves to the database.
And the second function is for the GET method.
This function, getting the image from db, segments with the YOLOv5 model, and returns the results.

But every call of the second function is loading the YOLOv5 model again.
Therefore, I take Cuda Out Of Memory the second or third call.
How can I load the model only once and use it every call?

Thanks for your help!

KenWhitesell · April 15, 2023, 4:13pm

You don’t - at least not within Django itself.

If you want some form of memory-resident model, you’re looking at perhaps something like a custom management command that is always running and exchanges messages with Django as needed.

However you do it, you want to segment your architecture between the persistent models being run and the request/response cycle provided by Django.

DanielGnzlzVll · April 16, 2023, 9:27pm

hi,

I suppose you have your model loading code in the view, you need to move it out of the view to the imports part, so it would be execute once and the start up and will persist for successive requests.

iskenderk · April 18, 2023, 9:20am

Hi Ken,
Is there not any way to do this in Django?
I can use the YOLO model in Django, for live streaming and it works.
But in this case, I need the POST and GET method.
Is the problem the POST and GET methods?

iskenderk · April 18, 2023, 9:40am

Hi Daniel,
I tried this, I loaded the model in another file and import it, but it is still same.
It works when it runs at port, but when it runs at UWSGI, loading the model for every call.

KenWhitesell · April 18, 2023, 11:41am

Django is fundamentally built around the idea of the “request / response” cycle. Objects are created when the request is received, and disposed when the response is returned.

In a production-quality deployment of Django, you also have multiple processes running. There’s no such thing as “sharing an object between processes in memory”. Additionally, the process manager will, based upon circumstances, restart any individual process.

If you want “persistent entities” between requests in a production Django environment, you want them outside the Django process.

That’s why, for example, each of Celery and Celery Beat are run as separate processes.

Topic		Replies	Views
Create globally available objects based on a model data Forms & APIs	5	1202	December 13, 2022
Object detection over the web using yolov5 with django web framework Mentorship	1	1397	May 12, 2022
no update in the view Getting Started	2	253	January 23, 2023
Images not loading correctly Using Django	5	2399	August 20, 2021
Deploy TensorFlow model Using Django	4	3702	April 22, 2020

Why YOLO model loading for every call and how can I solve?

Related topics