-
Notifications
You must be signed in to change notification settings - Fork 130
(Probable Playwright Request Overridden issue) Scrapy-playwright doesnot seem to work on website "njcourts" but playwright works #199
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I dug deeper into the issue and found the similar issue/bug in #100 and it seems to be closed after new pull request https://github.com/scrapy-plugins/scrapy-playwright/pull/144/files. But I think the issue still is not resolved.
|
I checked the har using browser context
The original request should have been POST, but with playwright-scrapy, the request is shown as GET. This is the result of this bug. Is there a way to just not modify the request with playwright-scrapy or is this something necessary for the library to work? I will appreciate it if you can point me in the right direction and let me know if this is the real issue. I feel like something is wrong in this conditional and we can just change the request if it is scrapy.Request, else is it necessary to change the request method? I will love to hear why this decision was made. Thank you, |
The code you mentioned in your comment was updated in #177 and has not been released yet. It's likely that it will actually solve your issue, I suspect that your POST request is probably not a navigation request, so it will not trigger the block that overrides the method. |
Description
I am trying to scrape the website "https://portal.njcourts.gov/webe40/JudgmentWeb/jsp/judgmentSearch.faces" but I cannot proceed any further than the homepage using scrapy_playwright but can do all operations with Playwright. If I click on any of the navigation tabs or click search, I get redirected to the page attached in the image.[the URL is the same as above]. This is not the issue of website blocking us as I can make this work using playwright as soon below.

Steps to Reproduce
Scrapy-Playwright Code
Vanilla Playwright code
Versions
Additional Information
The site seems to only work for American IPs.
If you cannot reproduce the issue or need more information, please let me know. I will appreciate a lot if you can point me in the right direction from here.
Thank you,
Binit
The text was updated successfully, but these errors were encountered: