Start scraping from certain paragraph

Viewing 4 posts - 1 through 4 (of 4 total)
  • Author
    Posts
  • #1945
    Ioannis MavroudisIoannis Mavroudis
    Participant

    Hello, I have purchased this fantastic plugin. I want to parse multiple posts from wikidoc.com.
    I do not want to scrape the table of contents, and I want the scraper to start from a certain paragraph and below.

    Thanks

    #1946
    Suman M.Suman M.
    Keymaster

    Hi, thanks! Can you please let us know what exactly you want to scrape and from which source url exactly? Screenshot will be helpful. Thanks!

    #1948
    Ioannis MavroudisIoannis Mavroudis
    Participant

    Hi,
    Thank you very much for your prompt response. I want to scrape content from a list urls, one of them is https://www.wikidoc.org/index.php/Glycogen_storage_disease_type_V.
    I want to scrape everything below Overview. When I use the scraper it takes everything, including the table of contents on the right which I do not want.

    Best Regards

    #1951
    Suman M.Suman M.
    Keymaster

    I checked it and because of the HTML structure in this site, the content without table of content cannot be scraped. You’ll need to filter out the unwanted content in your site’s frontend. You can either remove it using code before displaying or else use CSS like below (in theme’s stylesheet) to hide it.

    table.infobox { display:none !important; }

    Thanks & Regards!

Viewing 4 posts - 1 through 4 (of 4 total)

You must be logged in and have valid license to reply to this topic.

License required for the following item
Login and Registration Log in · Register