Welcome to openkapow Sign in | Join
in Search
.

How to build a Robot which can extract text from repeated HTML block?

Last post 07-07-2008, 5:41 AM by tomelf. 6 replies.
Sort Posts: Previous Next
  •  07-02-2008, 8:10 PM 20285

    How to build a Robot which can extract text from repeated HTML block?

    Hi!

    Does anyone know how to extract text from repeated HTML block? For example, this site has 8 school information, can I get all the "School Name", "School Address" without labeling all the name and address by hand? Can we apply some extraction patterns when using Robomaker?

    Thanks very much!

    Filed under: ,
  •  07-02-2008, 10:57 PM 20288 in reply to 20285

    Re: How to build a Robot which can extract text from repeated HTML block?

    Hi tomelf,

    Yes you can extract all school informations by using for each tag....

     


    K.Palanikumar
  •  07-03-2008, 6:49 AM 20300 in reply to 20288

    Re: How to build a Robot which can extract text from repeated HTML block?

    Here's a sample

     

    http://openkapow.com/files/folders/tutorials_and_examples/entry20298.aspx

     

    You have to find the right tag for the loop.

  •  07-03-2008, 9:12 PM 20307 in reply to 20285

    Re: How to build a Robot which can extract text from repeated HTML block?

    Thanks you all! It works!
  •  07-03-2008, 9:36 PM 20308 in reply to 20285

    Re: How to build a Robot which can extract text from repeated HTML block?

    And one more question...If the HTML structure in the page (slightly) changed, Can Robomaker detect this change? Or I should manually fix my robot and make it adapt the new version page?
  •  07-04-2008, 3:49 PM 20320 in reply to 20308

    Re: How to build a Robot which can extract text from repeated HTML block?

    You have to do a manual fix.

    Use an rss feed to detect runerrors:

    http://service.openkapow.com/claus/schooldata.info

    To make a robot more robust you could use the "find -tag" step to set the current tag but this is more work.

    .

     

     

  •  07-07-2008, 5:41 AM 20335 in reply to 20320

    Re: How to build a Robot which can extract text from repeated HTML block?

    Thanks for your answer! I'll try this way!
View as RSS news feed in XML
.
Copyright 2006, 2007 KapowTech.com All Rights Reserved Company | Contact | Terms | Privacy