Comparison of visualization methods of genome-wide SNP profiles in childhood acute lymphoblastic leukaemia

Al-Oqaily, A; Kennedy, PJ; Catchpoole, D; Simoff, S

Comparison of visualization methods of genome-wide SNP profiles in childhood acute lymphoblastic leukaemia

Al-Oqaily, A Kennedy, PJ

Catchpoole, D Simoff, S

Permalink

Publication Type:: Conference Proceeding
Citation:: Conferences in Research and Practice in Information Technology Series, 2008, 87 pp. 111 - 121
Issue Date:: 2008-12-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download full textAdobe PDF (943.39 kB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Al-Oqaily, A	en_US
dc.contributor.author	Kennedy, PJ https://orcid.org/0000-0001-7837-3171	en_US
dc.contributor.author	Catchpoole, D	en_US
dc.contributor.author	Simoff, S	en_US
dc.date.issued	2008-12-01	en_US
dc.identifier.citation	Conferences in Research and Practice in Information Technology Series, 2008, 87 pp. 111 - 121	en_US
dc.identifier.isbn	9781920682682	en_US
dc.identifier.issn	1445-1336	en_US
dc.identifier.uri	http://hdl.handle.net/10453/10755
dc.description.abstract	Data mining and knowledge discovery have been applied to datasets in various industries including biomedical data. Modelling, data mining and visualization in biomedical data address the problem of extracting knowledge from large and complex biomedical data. The current challenge of dealing with such data is to develop statistical-based and data mining methods that search and browse the underlying patterns within the data. In this paper, we employ several data reduction methods for visualizing genome- wide Single Nucleotide Polymorphism (SNP) datasets based on state-of-art data reduction techniques. Visualization approach has been selected based on the trustworthiness of the resultant visualizations. To deal with large amounts of genetic variation data, we have chosen to apply different data reduction methods to deal with the problem induced by high dimensionality. Based on the trustworthiness metric we found that neighbour Retrieval Visualizer (NeRV) outperformed other methods. This method optimizes the retrieval quality of Stochastic neighbour Embedding. The quality measure of the visualization (i.e. NeRV) showed excellent results, even though the dataset was reduced from 13917 to 2 dimensions. The visualization results will assist clinicians and biomedical researchers in understanding the systems biology of patients and how to compare different groups of clusters in visualizations. © 2008, Australian Computer Society, Inc.	en_US
dc.relation.ispartof	Conferences in Research and Practice in Information Technology Series	en_US
dc.title	Comparison of visualization methods of genome-wide SNP profiles in childhood acute lymphoblastic leukaemia	en_US
dc.type	Conference Proceeding
utslib.citation.volume	87	en_US
utslib.for	0803 Computer Software	en_US
dc.location.activity	Adelaide, Australia	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Software
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
pubs.organisational-group	/University of Technology Sydney/Strength - CHT - Health Technologies
utslib.copyright.status	open_access
pubs.publication-status	Published	en_US
pubs.volume	87	en_US

Abstract:

Data mining and knowledge discovery have been applied to datasets in various industries including biomedical data. Modelling, data mining and visualization in biomedical data address the problem of extracting knowledge from large and complex biomedical data. The current challenge of dealing with such data is to develop statistical-based and data mining methods that search and browse the underlying patterns within the data. In this paper, we employ several data reduction methods for visualizing genome- wide Single Nucleotide Polymorphism (SNP) datasets based on state-of-art data reduction techniques. Visualization approach has been selected based on the trustworthiness of the resultant visualizations. To deal with large amounts of genetic variation data, we have chosen to apply different data reduction methods to deal with the problem induced by high dimensionality. Based on the trustworthiness metric we found that neighbour Retrieval Visualizer (NeRV) outperformed other methods. This method optimizes the retrieval quality of Stochastic neighbour Embedding. The quality measure of the visualization (i.e. NeRV) showed excellent results, even though the dataset was reduced from 13917 to 2 dimensions. The visualization results will assist clinicians and biomedical researchers in understanding the systems biology of patients and how to compare different groups of clusters in visualizations. © 2008, Australian Computer Society, Inc.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/10755