We're Moving!

The Vertica Forum is moving to a new OpenText Analytics Database (Vertica) Community.

Join us there to post discussion topics, learn about

product releases, share tips, access the blog, and much more.

Create My New Community Account Now


Loading Genome data from sequencing system, illumine HiSeq Xs&2500. — Vertica Forum

Loading Genome data from sequencing system, illumine HiSeq Xs&2500.

I am pursuing a genomic analysis opportunity. My question is how Vertica can get the result of genomic analysis from a sequencing system. Do we need to use any ETL tool or any interface to get the data from them?

Comments

  • MosheMoshe - Select Field -

    No need for ETL tools. One can directly load the data with Vertica COPY command.
    You can run HiSeq analysis software under Linux to produce CSV files.
    In addition the HiSeq analysis software can run on a cluster using a shell script
    that will submit the analysis job to a queue manager.
    See: https://hpc.nih.gov/docs/hiseq.pdf
    Note that the amount of genomic analysis output data in total is not much.

Sign In or Register to comment.