• Topic
  • Discussion
  • VOS.VirtEC2AMIBio2rdfInstall(Last) -- DAVWikiAdmin? , 2017-06-29 07:36:56 Edit WebDAV System Administrator 2017-06-29 07:36:56

    Prepopulated Virtuoso hosting of Bio2RDF on EC2 AMI

    What is the Bio2RDF Project?

    A Bio Informatics knowledgebase available in RDF based Linked Data form. For additional information see:

    Bio2RDF Installation on Virtuoso EC2 AMI instance

    OpenLink Software provides a backup up of the current Bio2RDF knowledgebase currently exposed at <http://lod.openlinksw.com/sparql>. Users can restore a backup of this knowledgebase into a Virtuoso AMI instance in the Amazon EC2 cloud thereby obtaining a personal or service-specific rendition of the Bio2RDF Knowledgebase.

    Installation

    1. Start a Virtuoso EC2 AMI instance. Note a Virtuoso Release 5 AMI instance ( ami-ids ami-59628630 or ami-c46084ad ) must be used with this backup. A 64-bit large image AMI instance with 8GB of memory, which is the m1.large EC2 instance type, is the minimum recommended for Bio2RDF. For best performance, particularly under extensive usage, the 16GB m1.xlarge EC2 instance type is preferred.



    2. Load the Virtuoso Conductor administration interface of the running EC2 AMI instance with a URL of the form http://your-ec2-instance-cname/conductor.



    3. From the Virtuoso Conductor, navigate to the "System Admin" -> "Packages" tab to obtain a list of available Virtuoso packages (VADs) to install.



    4. Click the "Install" button to initiate installation of the "EC2 Extensions" VAD package for use in performing backup and restore tasks.



    5. Click the "Proceed" button to install the "EC2 Extensions" VAD package.

    6. Go to the URL http://your-ec2-instance-cname/ec2exts to load the Virtuoso Extensions for Amazon EC2 Images login page and log in as the "dba" user.



    7. From the Virtuoso Extensions for Amazon EC2 Images main page, click the "Restore a Remote Backup" link.



    8. On the "Restore a Remote Backup" page, set the follow options —

      Protocol: WebDAV/HTTP Host: s3.amazonaws.com Path or Bucket: bio2rdf-virtuoso-bundle Backup File Prefix: bio2rdf-bundle

    9. Click the "Restore" button to begin the restoration of the Bio2RDF database from backup.


      .
      .
      .


    10. Click on the "Continue" button to begin the re-assembly of the database from the restored backup files. Output similar to the following will be displayed when the re-assembly of the database is complete.



    Once complete the server will have been restarted automatically with the restored Bio2RDF database and is ready for use.

    Usage Examples

    You can then access pages such as these on your Bio2RDF server:

      • http://your-ec2-instance-cname/interpro:IPR008543



      • http://your-ec2-instance-cname/geneid:4421783



      • http://your-ec2-instance-cname/uniprot:A0A358



    Bio2RDF VAD Application Package

    For those running a Bio2RDF Virtuoso EC2 AMI instance created before December 18, 2008, you will need to update the installed Bio2RDF VAD Application package to obtain the latest enhancements, by taking the following steps —

    1. Download the Bio2RDF VAD Application (bio2rdf_dav.vad) package.
    2. Navigate to the "System Admin" -> "Packages" tab of the Virtuoso Conductor.



    3. Scroll down to the "Install Package" section of the tab, use the "Upload Package" option "browse" button.



    4. Navigate to the location of the downloaded bio2rdf_dav.vad file and click the "open" button to select it.



    5. Click the "Proceed" button to begin the installation process.



    6. Once the VAD installation is complete, the Bio2RDF installation is up-to-date and ready for use.

    Results

    • The Bio2RDF SPARQL endpoint can be accessed on http://your-ec2-instance-cname/sparql
    • The OpenLink Interactive SPARQL Query Builder can be accessed on http://your-ec2-instance-cname/isparql, enabling the visual construction of queries (Graph Patterns).

    Related