Project Details
Description
Data mining is the driving force behind a paradigm shift in the way we conceive models of our surroundings. Society relies extensively on models of physical, social, and economic phenomena, that give the predictive power allowing us to build a bridge, confident that it will not collapse, forecast the effect of economic stimulus packages, select the best livestock for breeding, etc. The massive amount of data that has become available opens up for completely new ways of understanding and modeling data, through the use of algorithms.
In this project we will explore the possibilities in a new approach to massive data mining with origins in advanced sampling methods of data stream algorithmics. This happy marriage of efficient algorithms and statistical principles is the unifying idea of several recent highly successful research contributions of the applicant in the field of data mining. In collaboration with external partners, the project will focus on three application areas with a common theoretical core: Financial modeling, recommendation systems, and genotype/phenotype mining.
In this project we will explore the possibilities in a new approach to massive data mining with origins in advanced sampling methods of data stream algorithmics. This happy marriage of efficient algorithms and statistical principles is the unifying idea of several recent highly successful research contributions of the applicant in the field of data mining. In collaboration with external partners, the project will focus on three application areas with a common theoretical core: Financial modeling, recommendation systems, and genotype/phenotype mining.
| Acronym | MaDaMS |
|---|---|
| Status | Finished |
| Effective start/end date | 01/01/2011 → 31/12/2014 |
Funding
- Independent Research Fund Denmark: DKK4,769,140.00
Keywords
- Algoritmik
- Data mining
- Data analyse
Fingerprint
Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.
-
Consistent subset sampling
Kutzkov, K. & Pagh, R., 2014, In: Lecture Notes in Computer Science. 8503, p. 294-305Research output: Journal Article or Conference Article in Journal › Conference article › Research › peer-review
-
Efficient estimation for high similarities using odd sketches
Mitzenmacher, M., Pagh, R. & Pham, N. D., 2014, Proceedings of the 23rd international conference on World wide web: WWW '14. Association for Computing Machinery, p. 109-118 10 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
-
Is Min-Wise Hashing Optimal for Summarizing Set Intersection?
Pagh, R., Stöckel, M. & Woodruff, D., 2014, Proceedings of the 2014 ACM SIGMOD international conference on Management of data. Association for Computing Machinery, p. 109-120Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
Press/Media
-
University student invents algorithm speeds up internet searches
Pham, N. D.
27/11/2014
1 item of Media coverage
Press/Media: Press / Media
-
Opfindelse fra Danmark gør computere hurtigere
Pham, N. D.
26/11/2014
1 item of Media coverage
Press/Media: Press / Media
Activities
- 1 Other (prizes, external teaching and other activities) - Prizes, scholarships, distinctions
-
Best Paper Award
Pham, N. D. (Participant)
7 Apr 2014 → 11 Apr 2014Activity: Other activity types › Other (prizes, external teaching and other activities) - Prizes, scholarships, distinctions