Virtuoso Open-Source Wiki
Virtuoso Open-Source, OpenLink Data Spaces, and OpenLink Ajax Toolkit
Advanced Search
Location: / Dashboard / Main / VOSIndex / VirtSetCrawlerJobsGuide / VirtCrawlerGuideAtom

Virtuoso Crawler Guide for populating Virtuoso Quad Store using ATOM feed


This Guide demonstrates populating the Virtuoso Quad Store using ATOM feed.


Populating the Virtuoso Quad Store can be done in different ways Virtuoso supports. The Conductor -> Content Import UI offers plenty of options, one of which is the XPath expression for crawling RDF resources URLs and this feature is a powerful and easy-to-use for managing the Quad Store.


To populate the Virtuoso Quad Store, in this Guide we will use a XPAth expression for the URLs of the RDF resources references in a given ATOM feed. For ex. this one of the "National Bibliography" Store.

Sample Scenario

  1. Go to http://cname/conductor
  2. Enter dba credentials
  3. Go to Web Application Server -> Content Management -> Content Imports:

  4. Click "New Target":

  5. In the presented form specify respectively:
    • Target description: for ex. National Bibliography ;
    • Target URL: for ex. ;
      • Note: the entered URL will be the graph URI for storing the imported RDF data. You can also set it explicitly by entering another graph URI in the "If Graph IRI is unassigned use this Data Source URL:" option.
    • Copy to local DAV collection: for ex.


    • XPath expression for links extraction:


    • Update Interval (minutes): for ex. 10 ;
    • Run Sponger: hatch this check-box ;
    • Accept RDF: hatch this check-box ;
    • Store metadata: hatch this check-box ;
    • RDF Cartridge: hatch this check-box and specify what cartridges will be used:

  6. Click "Create":
  7. The new created target should be displayed in the list of available Targets:

  8. Click "Import Queues":

  9. Click for "National Bibliography" target the "Run" link from the very-right "Action" column:
  10. Should be presented list of Top pending URLs:

  11. Go to "Retrieved Sites" to check the total URLs that were processed:

  12. To view the imported RDF data, go to http://cname/sparql and enter a simple query for ex.:

    SELECT * FROM <> WHERE { ?s ?p ?o }

  13. Click "Run Query".
  14. The imported RDF data triples should be shown:


Powered By Virtuoso