Macquarie Home | Course Handbook | Library | Campus Map | Macquarie Contacts
Home page

Macquarie University ResearchOnline

Home
Add
-List Of Titles -3PFDB - a database of Best Representative PSSM Profiles (BRPs) of protein families generated using a novel data mining approach

Please use this identifier to cite or link to this item: http://hdl.handle.net/1959.14/89070

OpenURL Link
84 Visitors 93 Hits 0 Downloads
FileDescriptionSizeFormat
DS01Publisher version (open access)1 MBAdobe Acrobat PDFView/Open
Title
3PFDB - a database of Best Representative PSSM Profiles (BRPs) of protein families generated using a novel data mining approach
Related
BioData mining, Vol. 2, Issue 8 (2009), p.1-10
DOI
10.1186/1756-0381-2-8
Publisher
BioMed Central
Date
2009
Author/Creator
Shameer, Khader
Author/Creator
Nagarajan, Paramasivam
Author/Creator
Kumar, Gaurav
Author/Creator
Sowdhamini, Ramanathan
Description
Background: Protein families could be related to each other at broad levels that group them as superfamilies. These relationships are harder to detect at the sequence level due to high evolutionary divergence. Sequence searches are strongly directed and influenced by the best representatives of families that are viewed as starting points. PSSMs are useful approximations and mathematical representations of protein alignments, with wide array of applications in bioinformatics approaches like remote homology detection, protein family analysis, detection of new members and evolutionary modelling. Computational intensive searches have been performed using the neural network based sensitive sequence search method called FASSM to identify the Best Representative PSSMs for families reported in Pfam database version 22. Results: We designed a novel data mining approach for the assessment of individual sequences from a protein family to identify a single Best Representative PSSM profile (BRP) per protein family. Using the approach, a database of protein family-specific best representative PSSM profiles called 3PFDB has been developed. PSSM profiles in 3PFDB are curated using performance of individual sequence as a reference in a rigorous scoring and coverage analysis approach using FASSM. We have assessed the suitability of 10, 85,588 sequences derived from seed or full alignments reported in Pfam database (Version 22). Coverage analysis using FASSM method is used as the filtering step to identify the best representative sequence, starting from full length or domain sequences to generate the final profile for a given family. 3PFDB is a collection of best representative PSSM profiles of 8,524 protein families from Pfam database. Conclusion: Availability of an approach to identify BRPs and a curated database of best representative PSI-BLAST derived PSSMs for 91.4% of current Pfam family will be a useful resource for the community to perform detailed and specific analysis using family-specific, best-representative PSSM profiles. 3PFDB can be accessed using the URL: http://caps.ncbs.res.in/3pfdb
Description
10 page(s)
Resource Type
journal article
Organisation
Macquarie University. Dept. of Chemistry and Biomolecular Sciences

Identifier
http://hdl.handle.net/1959.14/89070
Identifier
ISSN:1756-0381
Identifier
mq-rm-2009007412
Language
eng
Rights
© 2009 Shameer et al; licensee BioMed Central Ltd. Version archived for private and non-commercial use with the permission of the author and according to publisher conditions. For further rights please contact the publisher.
Full Text
Full Text
Reviewed
Reviewed
 
Image Thumbnail
Save/E-mail Citation
Citation Format
E-mail Address
Subject
"BioData mining"
 
OR
  • Show All  
  • Show My Selections 
Advanced Search

Search

Browse

  • By Title 
  • By Author/Creator 
  • By Department/Centre 
  • By Subject Keyword 
  • By Journal/Conference 
  • By FoR/RFCD codes 
  • By Resource Type 
  • By Date 

Highlights

  • Most Accessed Objects 
  • Recent Additions 
  • Pending Publications 
  • Author Profiles 

Resources

  • About ResearchOnline 
  • FAQ 
  • Open Access 
  • Open Access-FAQs 
  • Copyright 
  • Contribute 
  • Help 
  • Contact
  • Terms and Conditions 
Valid XHTML 1.0 Strict Powered by VITAL

Copyright Macquarie University | Privacy Statement | Accessibility Information

ABN 90 952 801 237 | CRICOS Provider No 00002J

Library Staff Sign In