Cannot scrape tags in a source url

Viewing 2 posts - 1 through 2 (of 2 total)
  • Author
    Posts
  • #1221
    mike panmike pan
    Participant

    Hello and happy new year.

    I am currently trying to scrape this source url http://www.gazzetta.gr/basketball/euroleague/article/1431217/neo-stoihima-poy-kerdizei-o-pitino and i succeed, but i cannot in any way scrape the Tags of all the articles in the url http://www.gazzetta.gr .
    I always get a result with all tags merged together without spaces even if i tried all different approaches. The xpaths i used are below.

    //div[@class=’field-items’]/div[@class=’field-item even’]/div[@class=’item-list’]/ul/li

    //div[@class=’field-items’]/div[@class=’field-item even’]/div[@class=’item-list’]/ul/li/a

    //div[@class=’field field-name-article-tags field-type-ds field-label-hidden’]/div[@class=’field-items’]/div[@class=’field-item even’]/div[@class=’item-list’]/ul/li/a

    /html/body/div[1]/div[1]/div[5]/div[3]/div[1]/div/div/div/div/div/div[2]/div/div[3]/div/div/div/ul/li/a

    Can you help me with this one?

    Best Regards

    Michael

    #1225
    Suman M.Suman M.
    Keymaster

    Hi, thanks and happy new year! Please use the following XPath for Data Field type “Tags”. Also enable “is Multiple” option – https://www.screencast.com/t/5X7xsgygJ2C

    //div[contains(concat (" ", normalize-space(@class), " "), "item-list")]/ul/li/a

Viewing 2 posts - 1 through 2 (of 2 total)

You must be logged in and have valid license to reply to this topic.

License required for the following item
Login and Registration Log in · Register