| |
|
2013/04/15, MC103, 12 - 12:30
A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotides distribution
Vladimir
Reinharz
, McGill SOCS
Area:
Bioinformatics, RNA structure prediction and sequence design
Abstract:
The design of RNA sequences folding into predefined secondary structures is a
milestone for many synthetic biology and gene therapy studies. Most of the current
software uses similar local search strategies (i.e. a random seed is progressively
adapted to acquire the desired folding properties) and more importantly do not
allow the user to control explicitly the nucleotide distribution such as the
GC-content in their sequences. However, the latter is an important criterion for
large-scale applications as it could presumably be used to design sequences with
better transcription rates and/or structural plasticity.
We introduce a novel algorithm to design RNA sequences folding into target
secondary structures with a predefined nucleotide distribution. It uses a global
sampling approach and weighted sampling techniques. We show that our approach is
fast (i.e. running time comparable or better than local search methods), seed-less
(we remove the bias of the seed in local search heuristics), and successfully
generates high-quality sequences (i.e. thermodynamically stable) for any GC-
content. To complete this study, we develop an hybrid method combining our global
sampling approach with local search strategies. Remarkably, our glocal methodology
overcomes both local and global approaches for sampling sequences with a specific GC
content and target structure.
|
|
|