Macquarie Home | Course Handbook | Library | Campus Map | Macquarie Contacts
Home page

Macquarie University ResearchOnline

Home
Add
-List Of Titles -Identifying change-points in biological sequences via sequential importance sampling

Please use this identifier to cite or link to this item: http://hdl.handle.net/1959.14/102357

OpenURL Link
39 Visitors 44 Hits 1 Downloads
Title
Identifying change-points in biological sequences via sequential importance sampling
Related
Environmental modeling and assessment, Vol. 14, Issue 5 (2009), p.577-584
DOI
10.1007/s10666-008-9160-8
Publisher
Springer
Date
2009
FoR/RFCD Code(s)
010000 Mathematical Sciences  050000 Environmental Sciences  080000 Information And Computing Sciences
Author/Creator
Sofronov, George Yu
Author/Creator
Evans, Gareth E
Author/Creator
Keith, Jonathan M
Author/Creator
Kroese, Dirk P
Description
The genomes of complex organisms, including the human genome, are highly structured. This structure takes the form of segmental patterns of variation in various properties and may be caused by the division of genomes into regions of distinct function, by the contingent evolutionary processes that gave rise to genomes, or by a combination of both. Whatever the cause, identifying the change-points between segments is potentially important, as a means of discovering the functional components of a genome, understanding the evolutionary processes involved, and fully describing genomic architecture. One property of genomes that is known to display a segmental pattern of variation is GC content. The GC content of a portion of DNA is the proportion of GC pairs that it contains. Sharp changes in GC content can be observed in human and other genomes. Such change-points may be the boundaries of functional elements or may play a structural role. We model genome sequences as a multiple change-point process, that is, a process in which sequential data are separated into segments by an unknown number of change-points, with each segment supposed to have been generated by a different process. We consider a Sequential Importance Sampling approach to change-point modeling using Monte Carlo simulation to find estimates of change-points as well as parameters of the process on each segment. Numerical experiments illustrate the effectiveness of the approach. We obtain estimates for the locations of change-points in artificially generated sequences and compare the accuracy of these estimates to those obtained via Markov chain Monte Carlo and a well-known method, IsoFinder. We also provide examples with real data sets to illustrate the usefulness of this method.
Description
8 page(s)
Subject Keyword
010000 Mathematical Sciences
Subject Keyword
050000 Environmental Sciences
Subject Keyword
080000 Information And Computing Sciences
Subject Keyword
comparative genomics
Subject Keyword
multiple change-point problem
Resource Type
journal article
Organisation
Macquarie University. Dept. of Statistics

Identifier
http://hdl.handle.net/1959.14/102357
Identifier
ISSN:1420-2026
Identifier
mq-rm-2009010594
Language
eng
Reviewed
Reviewed
Save/E-mail Citation
Citation Format
E-mail Address
Subject
"Environmental modeling and assessment"
 
OR
  • Show All  
  • Show My Selections 
Advanced Search

Search

Browse

  • By Title 
  • By Author/Creator 
  • By Department/Centre 
  • By Subject Keyword 
  • By Journal/Conference 
  • By FoR/RFCD codes 
  • By Resource Type 
  • By Date 

Highlights

  • Most Accessed Objects 
  • Recent Additions 
  • Pending Publications 
  • Author Profiles 

Resources

  • About ResearchOnline 
  • FAQ 
  • Open Access 
  • Open Access-FAQs 
  • Copyright 
  • Contribute 
  • Help 
  • Contact
  • Terms and Conditions 
Valid XHTML 1.0 Strict Powered by VITAL

Copyright Macquarie University | Privacy Statement | Accessibility Information

ABN 90 952 801 237 | CRICOS Provider No 00002J

Library Staff Sign In