Loading Genome data from sequencing system, illumine HiSeq Xs&2500.

I am pursuing a genomic analysis opportunity. My question is how Vertica can get the result of genomic analysis from a sequencing system. Do we need to use any ETL tool or any interface to get the data from them?


  • Options
    MosheMoshe - Select Field -

    No need for ETL tools. One can directly load the data with Vertica COPY command.
    You can run HiSeq analysis software under Linux to produce CSV files.
    In addition the HiSeq analysis software can run on a cluster using a shell script
    that will submit the analysis job to a queue manager.
    See: https://hpc.nih.gov/docs/hiseq.pdf
    Note that the amount of genomic analysis output data in total is not much.

Sign In or Register to comment.