Unable to launch a task, with multiple posts
wpBots Support – The best crawlers for WordPress › Forums › SCRAPER (after-sales) › Tasks Troubleshooting › Unable to launch a task, with multiple posts
- This topic has 17 replies, 2 voices, and was last updated 4 years, 7 months ago by Suman M..
-
AuthorPosts
-
March 25, 2020 at 12:50 pm #1721herve fParticipant
HI,
1 / I have an error message:
Errors enabled, full PHP dump output log with all error and warnings Array{"success":true,"results":{"source_connection":true,"collected_urls":["https:\/\/www.amazon.fr\/joueur-fl%C3%BBte-Hamelin-Lisbeth-Zwerger\/dp\/2354130627\/ref=sr_1_1?keywords=Actes+sud&link_code=qs&qid=1585140209&rnid=301130&s=books&sourceid=Mozilla-search&sr=1-1","","","","","","","","","","","","","","","","","","","","","",""],"http_status_code":200,"last_index":"22","next_page_path_defined":false,"insert_error":[]}} HTTP Code : 200
2 / I duplicated a task by copying a new url from Amazon (list of books from an editor). It puts the same error on me.
I hope what I did is correct and allows me to recover any list of books from Amazon?A task with the same setting but a single post works well
Regards
March 25, 2020 at 1:50 pm #1723Suman M.KeymasterHi, it’s actually an error log. And there, HTTP Code is 200, which means that the scraping is working fine. Please try disabling “error reporting” from License and settings tab. Let us know.
March 25, 2020 at 2:39 pm #1725herve fParticipantHi,
1 / I have already disabled it! (see image)
2 / ok is that the right method?
RegardsAttachments:
You must be logged in to view attached files.March 26, 2020 at 5:51 am #1731Suman M.KeymasterHi, can you please let us know your site’s backend login details (as private reply) so that we can look into the issue? And let us know the task ID that’s having issue. Thanks!
March 26, 2020 at 9:10 am #1732herve fParticipantThis reply has been marked as private.March 29, 2020 at 8:18 am #1739Suman M.KeymasterThis reply has been marked as private.March 29, 2020 at 2:02 pm #1741herve fParticipantHI,
I just modified the “Authors” field a little, but I have already observed this same error message.
Then I copied the task and I changed the url (list of all the books of an editor). Doesn’t this pose a problem?I also disabled other plugins to see if there was a conflict.
RegardsMarch 31, 2020 at 6:24 am #1746Suman M.KeymasterThis reply has been marked as private.March 31, 2020 at 8:17 am #1748herve fParticipantHi
Ah very sad 🙁
I hope you can find a workaround idea because it’s all the main use of the site / plugin that is no longer useful
RegardsMarch 31, 2020 at 8:21 am #1749herve fParticipant1/ Even by getting a few books per hour ?
March 31, 2020 at 1:26 pm #1752Suman M.KeymasterThis reply has been marked as private.March 31, 2020 at 1:44 pm #1755herve fParticipantHi,
I am very disappointed :-((
I spent days testing plugins and then configuring exactly. This was so far the best.
I imagine that it must be complicated to find a solution for you but I have no other solution than to hope that you find a workaround (stealth mode simulating a human being with a web browser).
Other sites will do the same and I guess I’m not the only one with sites like Amazon
RegardsApril 1, 2020 at 11:49 am #1758Suman M.KeymasterIt seems to be working fine now. I needed to change XPath of Item’s Path. Can you please check and let us know? Thanks!
April 1, 2020 at 12:39 pm #1759herve fParticipantHi,
Great it works again 🙂
I did not understand if amazon blocked or if you had to make a modification of your program or Amazon only?I also tested amazon.fr 1 herve – Copy
Task ID: e26d1a6b3334f5dc47f7bc9d17a532d2
which is the copy of the 1st task.
I’m happy because it works too, even having copied a new url, which lists the books of a publisher!
I need clarification because1 / I found myself with a duplicate after having reset a task.
Once the task is properly set. I no longer need to reset to avoid recovering duplicates. Did I understand right?2 / I have trouble finding the right number to recover.
Would the following settings to retrieve a number of items be generally correct
a) Manual action (ex: amazon)
Schedule: Interval & process delay = “not defined”
Limits: Loop = “20” & maximum limit = “9999”b) Automatic action on a blog
Schedule: Interval = “every day” & process delay = “5mn”
Limits: Loop = “2” & maximum limit = “9999”Regards
April 2, 2020 at 9:02 am #1761herve fParticipantHI,
I had a problem on the site :-(.
I hadn’t backed up yet. In order not to lose what you have done, can you answer my previous questions by specifying if your modifications were made in files of your extension (see stored in a database)?
Regards -
AuthorPosts
You must be logged in and have valid license to reply to this topic.