Project Login
Registration No:
Password:
MAIL ALERTS SMS ALERTS
 
     
   
     

A Cloud Infrastructure for Optimization of a Massive Parallel Sequencing Workflow

Platform : DOT NET

IEEE Projects Years : 2012 - 13

A Cloud Infrastructure for Optimization of a Massive Parallel Sequencing Workflow

 

Abstract:

 

 

 

Massive Parallel Sequencing is a term used to describe several revolutionary approaches to DNA sequencing, called Next Generation Sequencing (NGS) technologies. These technologies generate millions of short sequence fragments in a single run and can be used to measure levels of gene expression and to identify novel splice variants of genes allowing more accurate analysis. The proposed solution provides novelty on two fields, firstly an optimization of the read mapping algorithm has been designed, in order to parallelize processes, secondly an implementation of an architecture that consists of a Grid platform, composed of physical nodes, a Virtual platform, composed of virtual nodes set up on demand, and a scheduler that allows to integrate the two platforms.

 

 

 

Existing System:

 

 

 

Input consists of two files and have size about 3 GB each, consists of twelve files about 500 MB each instead support files have a size of about 4 GB each. In the original version of TopHat, for elaboration of a single sample, RAM required is about 8 GB and at least 60 GB of free space hard disk. Figure 2 depicts the processing time of entire flow of TopHat for a single sample when nodes number increases. Elaboration with only a node corresponds to original version of TopHat, it means in sequential version. We want to focus the attention on elaboration time when 3/4/5 nodes are available, we obtained no gain of time because each node has more than one segment to process.

 

 

 

Proposed System:

 

 

 

The purpose is to offer to biologist an friendly infrastructure to conduct their research and to respond to the ever evolving needs of Next Generation Sequencing (NGS) technologies users. Biologists are already using the Amazon Elastic Cloud Computing infrastructure for their research but in some contexts, it is preferable to use a number of instances of a tailored Virtual Machine than submitting jobs to the own existing infrastructure. The system allows to reduces the elaboration time already for a single sample. Future work includes an improvement of scheduling policies, balancing number of jobs and resources, this study also opens to a scenario multi samples, allowing to elaborate more sample simultaneously.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

HARDWARE & SOFTWARE REQUIREMENTS:

 

 

 

HARDWARE REQUIREMENTS:

 

 

 

Processor                     :        Intel Pentium-IV

 

Speed                          :         1.6GHz

 

RAM                           :         2GB

 

Hard Disk                   :         500GB

 

General                        :        Key Board, Monitor, Mouse

 

 

 

 

 

SOFTWARE REQUIREMENTS:

 

 

 

Operating System       :           Windows XP, 7.

 

Software                     :           VS .NET 2008, SQL Server Tools 2005.

 

 



NOW GET PROJECTS ! GET TRAINED ! GET PLACED !

IEEE, NON-IEEE, REAL TIME LIVE ACADEMIC PROJECTS,

PROJECTS WITH COMPLETE COURSES,SOFT SKILLS & PLACEMENTS

ALLOVER INDIA & WORLD WIDE,

HOSTEL FACILITY AVAILABLE FOR GIRLS & BOYS SEPARATELY,

CALL: 08985129129 ,  E-Mail Id: support@ascentit.in

REGISTER FOR PROJECTS NOW ! GET DISCOUNT
   
1