Amazon webscraping mysteriously stops working when used in django

Daniel1464 · September 16, 2024, 3:18am

Hi guys,

I’ve been facing a pretty perplexing issue regarding web scraping w/ a django web server.

You see, I’ve coded a successful async method that uses aiohttp to scrape amazon.com. When I run this async method in a regular main() method, it seems to work. However, whenever I run it using django, the second request always seems to get caught by amazon.

I’m using the aiohttp client session getter referenced here:

Can anyone help me out here? Thanks!

anefta · September 16, 2024, 12:00pm

Can you post your code? Which part exactly does not work?

KenWhitesell · September 16, 2024, 12:30pm

Welcome @Daniel1464 !

Additionally, please clarify what you mean by:

What exactly is happening?

Daniel1464 · September 16, 2024, 3:03pm

Alright, lemme try and give a clearer explanation(sorry about that).

Basically, I am using a django server to scrape amazon.

I have an async function that takes a link and scrapes the amazon website for the ingredients of that page.

This works 100% of the time when I run these functions together using asyncio.gather().

However, when I try to run the asyncio.gather() method within a django view, the first link will always work, but the second link always fails(now, it simply returns a “not found” page even though I literally copy-pasted the link from my browser).

My question is that if there’s any ASGI configuration that could help fix this. Thanks!

anefta · September 18, 2024, 10:21pm

But what is your current code? What is your current configuration?

Topic		Replies	Views
cannot request get/post to external api with asgi mode Using Django	5	2335	September 7, 2021
Django hangs on async views with asycio.gather and an async ORM call Async/Channels	3	1949	October 17, 2023
How should i write a view using Django 3 and async/await? Using Django	4	5973	September 15, 2019
In django asynchronous view, multiple consecutive requests will not run? Using Django	6	2346	June 24, 2021
Multiple Daemon threads getting spawned in ASGI protocol with daphne server Async/Channels	1	677	September 21, 2024

Amazon webscraping mysteriously stops working when used in django

Related topics