Loading Genome data from sequencing system, illumine HiSeq Xs&2500.
Kaito
Employee
I am pursuing a genomic analysis opportunity. My question is how Vertica can get the result of genomic analysis from a sequencing system. Do we need to use any ETL tool or any interface to get the data from them?
0
Comments
No need for ETL tools. One can directly load the data with Vertica COPY command.
You can run HiSeq analysis software under Linux to produce CSV files.
In addition the HiSeq analysis software can run on a cluster using a shell script
that will submit the analysis job to a queue manager.
See: https://hpc.nih.gov/docs/hiseq.pdf
Note that the amount of genomic analysis output data in total is not much.