Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
start [2013/02/15 14:35]
admin
start [2017/02/21 06:45] (current)
admin [Publications]
Line 2: Line 2:
  
 The domain of data mining (DM) deals with analyzing different types of data. The data typically used in data mining is  in the format of a single table, with primitive datatypes as attributes. However, structured (complex) data, such as graphs, sequences, networks, text, image, multimedia and relational data, are receiving an increasing amount of interest in data mining. A major challenge is to treat and represent the mining of different types of structured data in a uniform fashion. ​ The domain of data mining (DM) deals with analyzing different types of data. The data typically used in data mining is  in the format of a single table, with primitive datatypes as attributes. However, structured (complex) data, such as graphs, sequences, networks, text, image, multimedia and relational data, are receiving an increasing amount of interest in data mining. A major challenge is to treat and represent the mining of different types of structured data in a uniform fashion. ​
 +
 +{{ :​ontodmmodules.png?​nolink&​300|}}
  
 A theoretical framework that unifies different data mining tasks, on different types of data can help to formalize the knowledge about the domain and provide a base for future research, unification and standardization. ​ Next, automation and overall support of the Knowledge Discovery in Databases (KDD) process is also an important challenge in the domain of data mining. A formalization of the domain of data mining is a solution that addresses these challenges. It can directly support the development of a general framework for data mining, support the representation of the process of mining structured data, and allow the representation of the complete process of knowledge discovery.  A theoretical framework that unifies different data mining tasks, on different types of data can help to formalize the knowledge about the domain and provide a base for future research, unification and standardization. ​ Next, automation and overall support of the Knowledge Discovery in Databases (KDD) process is also an important challenge in the domain of data mining. A formalization of the domain of data mining is a solution that addresses these challenges. It can directly support the development of a general framework for data mining, support the representation of the process of mining structured data, and allow the representation of the complete process of knowledge discovery.
-{{ :​ontodm-structure.png?​nolink&​200|}} 
  
 We propose a reference modular ontology for the domain of data mining OntoDM, directly motivated by the need for formalization of the data mining domain. The OntoDM ontology is designed and implemented by following ontology best practices and design principles. Its distinguishing feature is that it uses [[http://​www.ifomis.org/​bfo|Basic Formal Ontology (BFO)]] as an upper-level ontology and a template, a set of formally defined relations from Relational Ontology (RO) and other state-of-the-art ontologies, and reuses classes and relations from the [[http://​obi-ontology.org/​page/​Main_Page|Ontology of Biomedical Investigations (OBI)]], the [[http://​code.google.com/​p/​information-artifact-ontology/​|Information Artifact Ontology (IAO)]], and the [[http://​theswo.sourceforge.net/​|Software Ontology (SWO)]]. This will ensure compatibility and connections with other ontologies and allow cross-domain reasoning capabilities. ​ We propose a reference modular ontology for the domain of data mining OntoDM, directly motivated by the need for formalization of the data mining domain. The OntoDM ontology is designed and implemented by following ontology best practices and design principles. Its distinguishing feature is that it uses [[http://​www.ifomis.org/​bfo|Basic Formal Ontology (BFO)]] as an upper-level ontology and a template, a set of formally defined relations from Relational Ontology (RO) and other state-of-the-art ontologies, and reuses classes and relations from the [[http://​obi-ontology.org/​page/​Main_Page|Ontology of Biomedical Investigations (OBI)]], the [[http://​code.google.com/​p/​information-artifact-ontology/​|Information Artifact Ontology (IAO)]], and the [[http://​theswo.sourceforge.net/​|Software Ontology (SWO)]]. This will ensure compatibility and connections with other ontologies and allow cross-domain reasoning capabilities. ​
 ====== OntoDM Structure ====== ====== OntoDM Structure ======
 The OntoDM ontology is composed of three sub-ontologies covering different aspects of data mining: ​ The OntoDM ontology is composed of three sub-ontologies covering different aspects of data mining: ​
-  * **[[OntoDT]]**,​ that supports the representation of knowledge about datatypes and is based on an accepted [[http://​standards.iso.org/​ittf/​PubliclyAvailableStandards/​c039479_ISO_IEC_11404_2007(E).zip|ISO standard for datatypes in computer systems]];  +  * **[[OntoDT|Ontology of Datatypes (OntoDT)]]**, that supports the representation of knowledge about datatypes and is based on an accepted [[http://​standards.iso.org/​ittf/​PubliclyAvailableStandards/​c039479_ISO_IEC_11404_2007(E).zip|ISO standard for datatypes in computer systems]];  
-  * **[[OntoDM-core]]**,​ that formalizes the key data mining entities for representing the mining of structured data in the context of a [[http://​link.springer.com/​chapter/​10.1007%2F978-3-540-75549-4_16?​LI=true|general framework for data mining]]; and  +  * **[[OntoDM-core|Ontology of Core Data Mining Entities (OntoDM-core)]]**, that formalizes the key data mining entities for representing the mining of structured data in the context of a [[http://​link.springer.com/​chapter/​10.1007%2F978-3-540-75549-4_16?​LI=true|general framework for data mining]]; and  
-  * **[[OntoDM-KDD]]**,​ that formalizes the knowledge discovery process based on the [[http://​www.google.si/​url?​sa=t&​rct=j&​q=crisp-dm&​source=web&​cd=7&​cad=rja&​ved=0CEcQFjAG&​url=http%3A%2F%2Fwww.the-modeling-agency.com%2Fcrisp-dm.pdf&​ei=Za4CUYrwIaji4QSX9YHgCA&​usg=AFQjCNFTNXE36E5pSvSvKi_8QhAv2w0ayA|Cross Industry Standard Process for Data Mining (CRISP-DM)]] process model. ​ +  * **[[OntoDM-KDD|Ontology of Data Mining Investigations (OntoDM-KDD)]]**, that formalizes the knowledge discovery process based on the [[http://​www.google.si/​url?​sa=t&​rct=j&​q=crisp-dm&​source=web&​cd=7&​cad=rja&​ved=0CEcQFjAG&​url=http%3A%2F%2Fwww.the-modeling-agency.com%2Fcrisp-dm.pdf&​ei=Za4CUYrwIaji4QSX9YHgCA&​usg=AFQjCNFTNXE36E5pSvSvKi_8QhAv2w0ayA|Cross Industry Standard Process for Data Mining (CRISP-DM)]] process model.  
- +====== Publications ====== 
-====== ​Versions & Publications ​related to the versions ​====== +  * **[[version_1.0r|Release ​1.0R]]** 
-  * Release +    * **OntoDT:** Panče Panov, Larisa Soldatova, Sašo Džeroski.** [[http://www.sciencedirect.com/​science/​article/​pii/​S0020025515005800|Generic Ontology of Datatypes]]**. Information Sciences 329 (2016) 900–920 ​ 
-    * **[[Version 1.0R]]** +    * **OntoDM-core:​** Panče Panov, Larisa Soldatova, Sašo Džeroski.** [[https://​www.dropbox.com/​s/​pnu2tud5z8766id/​OntoDM-core-Springer.pdf?​dl=0|Ontology of core data mining entities]]**. Data Mining and Knowledge Discovery 28(5-6):​1222-1265,​ 2014 DOI [[http://​link.springer.com/​article/​10.1007/​s10618-014-0363-0#​|10.1007/​s10618-014-0363-0]]  
-      * Panče Panov. [[https://​www.dropbox.com/​s/​0w1gwjja76sipgi/​PanovPhD2012.pdf|A Modular Ontology of Data Mining]]. Doctoral Thesis. Jožef Stefan International Postgraduate School. 2012  +    * **OntoDM-KDD:​** Panče Panov, Larisa Soldatova, Sašo Džeroski. **[[http://​link.springer.com/​chapter/​10.1007/​978-3-642-40897-7_9|OntoDM-KDD:​ Ontology for Representing the Knowledge Discovery Process]]**. Discovery Science 2013, Lecture Notes in Computer Science Volume 8140, pp 126-140, 2013 
-  ​* Preliminary release +    * **Doctoral Thesis: ** Panče Panov. **[[https://​www.dropbox.com/​s/​0w1gwjja76sipgi/​PanovPhD2012.pdf?dl=0|A Modular Ontology of Data Mining]]**. Doctoral Thesis. Jožef Stefan International Postgraduate School. 2012 
-    ​* **[[Version 0.2P]]**+  * **[[Version 0.2P]]**
       * Panče Panov, Larisa Soldatova and Sašo Džeroski. **[[http://​www.springerlink.com/​content/​q687176373767mnu/​|Representing Entities in the OntoDM Data Mining Ontology]]** . In Sašo Džeroski, Bart Goethals and Panče Panov (Eds.) Inductive Databases and Constraint-Based Data Mining., pg.27-55., 2010, Springer ​ {{:​ontodm-final-book.pdf|Download}}       * Panče Panov, Larisa Soldatova and Sašo Džeroski. **[[http://​www.springerlink.com/​content/​q687176373767mnu/​|Representing Entities in the OntoDM Data Mining Ontology]]** . In Sašo Džeroski, Bart Goethals and Panče Panov (Eds.) Inductive Databases and Constraint-Based Data Mining., pg.27-55., 2010, Springer ​ {{:​ontodm-final-book.pdf|Download}}
       * Panče Panov, Larisa Soldatova, Sašo Džeroski. **[[http://​www.springerlink.com/​content/​u55g42627185r7t7/​|Towards an Ontology of Data Mining Investigations]]**. Proceedings of the Twelfth International Conference on Discovery Science (DS09), Lecture Notes in Computer Science (LNCS) Vol. 5808, Porto, Portugal 3-5 October 2009, Springer {{:​ds_09_panovetal.pdf|Download}}       * Panče Panov, Larisa Soldatova, Sašo Džeroski. **[[http://​www.springerlink.com/​content/​u55g42627185r7t7/​|Towards an Ontology of Data Mining Investigations]]**. Proceedings of the Twelfth International Conference on Discovery Science (DS09), Lecture Notes in Computer Science (LNCS) Vol. 5808, Porto, Portugal 3-5 October 2009, Springer {{:​ds_09_panovetal.pdf|Download}}
-    ​* **[[Version 0.1P]]**+  ​* **[[Version 0.1P]]**
         * Panče Panov, Sašo Džeroski, Larisa N. Soldatova. **[[http://​www2.computer.org/​portal/​web/​csdl/​doi/​10.1109/​ICDMW.2008.62|OntoDM:​ an ontology of data mining]]**. In: Bonchi, Francesco (ed.). ICDM workshops 2008 : proceedings. Los Alamitos (California);​ Washington; Tokyo: IEEE Computer Society Conference Publishing Services, cop. 2008, pg. 752-760. {{:​ontodm_panovetal_icdmw08.pdf|Download}}         * Panče Panov, Sašo Džeroski, Larisa N. Soldatova. **[[http://​www2.computer.org/​portal/​web/​csdl/​doi/​10.1109/​ICDMW.2008.62|OntoDM:​ an ontology of data mining]]**. In: Bonchi, Francesco (ed.). ICDM workshops 2008 : proceedings. Los Alamitos (California);​ Washington; Tokyo: IEEE Computer Society Conference Publishing Services, cop. 2008, pg. 752-760. {{:​ontodm_panovetal_icdmw08.pdf|Download}}
-====== ​Contributors & Developers ​====== +====== ​Main Developer ​====== 
-  ​* ​[[http://​www-ai.ijs.si/​SasoDzeroski/​|Sašo Džeroski]]Jožef Stefan Institute, Department of Knowledge Technologies,​ Ljubljana, Slovenia[[Saso.Dzeroski@ijs.si]] +[[http://​www.pancepanov.com|Panče Panov]] Jožef Stefan Institute, Department of Knowledge Technologies,​ Ljubljana, Slovenia [[Pance.Panov@ijs.si]] ​ 
-  * [[http://​www.brunel.ac.uk/siscm/disc/people-in-disc/academic-staff/​dr-larisa-soldatova|Larisa Soldatova]]Department of Information Systems and Computing, Brunel University, Uxbridge, UK[[larisa.soldatova@brunel.ac.uk]] +====== Contributors ====== 
-  * [[http://kt.ijs.si/panovp|Panče Panov]]Jožef Stefan Institute, Department of Knowledge Technologies,​ Ljubljana, Slovenia[[Pance.Panov@ijs.si]] +  * [[http://​www.brunel.ac.uk/​people/​larisa-soldatova|Larisa Soldatova]] Department of Computer Science, Brunel University, Uxbridge, UK [[larisa.soldatova@brunel.ac.uk]] 
- +  * [[http://www-ai.ijs.si/SasoDzeroski/​|Sašo Džeroski]] Jožef Stefan Institute, Department of Knowledge Technologies,​ Ljubljana, Slovenia [[Saso.Dzeroski@ijs.si]]
-====== Contact ====== +
-[[admin@ontodm.com]]+

QR Code
QR Code The OntoDM ontology (generated for current page)