Skip to main navigation Skip to search Skip to main content

Massive Data Mining by Sampling

  • Pagh, Rasmus (PI)
  • Stöckel, Morten (CoI)
  • Pham, Ninh Dang (CoI)

Project: Research

Search results

  • 2014

    Consistent subset sampling

    Kutzkov, K. & Pagh, R., 2014, In: Lecture Notes in Computer Science. 8503, p. 294-305

    Research output: Journal Article or Conference Article in JournalConference articleResearchpeer-review

  • Efficient estimation for high similarities using odd sketches

    Mitzenmacher, M., Pagh, R. & Pham, N. D., 2014, Proceedings of the 23rd international conference on World wide web: WWW '14. Association for Computing Machinery, p. 109-118 10 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • Is Min-Wise Hashing Optimal for Summarizing Set Intersection?

    Pagh, R., Stöckel, M. & Woodruff, D., 2014, Proceedings of the 2014 ACM SIGMOD international conference on Management of data. Association for Computing Machinery, p. 109-120

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • Listing Triangles

    Björklund, A., Pagh, R., Vassilevska Williams, V. & Zwick, U., 2014, Automata, Languages, and Programming, 41st International Colloquium, ICALP 2014. Springer, p. 223-234

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • MapReduce Triangle Enumeration With Guarantees

    Park, H.-M., Silvestri, F., Kang, U. & Pagh, R., 2014, CIKM '14 Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management . Association for Computing Machinery, p. 1739-1748

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • On the Power of Randomization in Big Data Analytics

    Pham, N. D., 7 Oct 2014, IT University of Copenhagen: IT-Universitetet i København. 117 p.

    Research output: ThesesPhD thesis

    Open Access
    File
  • Triangle counting in dynamic graph streams

    Kutzkov, K. & Pagh, R., 2014, Algorithm Theory – SWAT 2014. Springer

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • 2013

    Deterministic algorithms for skewed matrix products

    Kutzkov, K., 2013, In: Dagstuhl Seminar Proceedings. 12 p.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  • On the streaming complexity of computing local clustering coefficients

    Kutzkov, K. & Pagh, R., 2013, WSDM '13 Proceedings of the sixth ACM international conference on Web search and data mining. Association for Computing Machinery, p. 677-686 9 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • STRIP: stream learning of influence probabilities

    Kutzkov, K., 2013, KDD '13 Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining . Association for Computing Machinery, p. 275-283 9 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • 2012

    A Near-linear Time Approximation Algorithm for Angle-based Outlier Detection in High-dimensional Data

    Pham, N. D. & Pagh, R., 12 Aug 2012, KDD '12 Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining . Association for Computing Machinery, p. 877-885 9 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

    File
  • Compressed Matrix Multiplication

    Pagh, R., 2012, ITCS 12. Proceedings of the Innovations in Theoretical Computer Science Conference, 3 . Association for Computing Machinery, p. 442-451

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • Improved counter based algorithms for frequent pairs mining in transactional data streams

    Kutzkov, K., 2012, ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases . Springer, Vol. Part 1. 16 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  • 2011

    Frequent Pairs in Data Streams: Exploiting Parallelism and Skew

    Campagna, A., Kutzkow, K. & Pagh, R., 2011, Proceedings of IEEE International Conference on Data Mining Workshops: ICDMW 2011. IEEE, p. 145 - 150

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review