Join Discord
Powered by
Handling Dynamic Links with Crawlee PlaywrightCraw...
# crawlee-js
b
boatbxy
01/14/2025, 5:34 PM
I’m working on a project using PlaywrightCrawler to scrape links from a dynamic JavaScript-rendered website. The challenge is that the
tags don’t have href attributes, so I need to click on them and capture the resulting URLs. - Delayed Link Rendering: Links are dynamically rendered with JavaScript, often taking time due to a loader. How can I ensure all links are loaded before clicking? - Navigation Issues: Some links don’t navigate as expected or fail when trying to open in a new context. - Memory Overload: I get the warning "Memory is critically overloaded" during crawls I've attached images of my code (it was too long so I couldn't paste it) How can I handle these issues more efficiently, especially for dynamic and JavaScript-heavy sites? I would appreciate any help
https://cdn.discordapp.com/attachments/1328779272000639119/1328779272353218572/image.png?ex=6787f1a2&is=6786a022&hm=f809baa35a830f152ffe2b99ebc073e91bee15117d81fa53493cd8724507fa78&
https://cdn.discordapp.com/attachments/1328779272000639119/1328779273321975828/image.png?ex=6787f1a2&is=6786a022&hm=6567b11df2bcfe5f51482923712ad73dee84f6699d8632feb2b95a01d707731a&
h
Hall
01/14/2025, 5:34 PM
Someone will reply to you shortly. In the meantime, this might help:
a
Alexey Udovydchenko
01/15/2025, 3:42 AM
First and foremost allocate more RAM, then, with enough memory, performance might be good enough to avoid navigation and loading issues
b
boatbxy
01/15/2025, 7:28 AM
Alright, thank you!
Previous
Next