2015
  • Aayushee Gupta and Haimonti Dutta. Evaluation of Spell Correction on Noisy OCR Data. INFORMS Workshop on Data Mining and Analytics at INFORMS Annual Meeting, Philadelphia, October 2015.
  • Aayushee Gupta and Haimonti Dutta. A Machine Learning Framework for Quantitative Prosopography, Grace Hopper Celebration in India, December 2015.
  • Megha Gupta, Haimonti Dutta, Brian Geiger. Classification of Crowdsourced Text Correction. iKDD CODS, March 2015
2014
  • Nipun Batra, Amarjeet Singh, Pushpendra Singh, Haimonti Dutta, Venkatesh Sarangan and Mani Srivastava, "Data Driven Energy Efficiency in Buildings", April 2014, arXiv:1404.7227
  • Nipun Batra, Jack Kelly, Oliver Parson, Haimonti Dutta, William Knottenbelt, Alex Rogers, Amarjeet Singh and Mani Srivastava, "NILMTK: An Open Source Toolkit for Non-Intrusive Load Monitoring", In Proceedings of the 5th International Conference on Future Energy Systems (ACM e-Energy), Cambridge, UK, Jun. 2014.
2013
  • Nipun Batra, Haimonti Dutta and Amarjeet Singh, "INDiC: Improved Non-Intrusive Load Monitoring using Load Division and Calibration.", 12th International Conference on Machine Learning and Applications, Miami, FL, Dec. 2013.
2012
  • Haimonti Dutta and William Chan. "Using community structure detection to rank annotators when ground truth is subjective", NIPS Workshop on Human Computation for Science and Computational Sustainability, Lake Tahoe, December 7th, 2012.
  • Xianshu Zhu, Tushar Mahule, Haimonti Dutta, Sugandha Arora, Hillol Kargupta, Kirk D. Borne."Peer-to-peer distributed text classifier learning in PADMINI." Statistical Analysis and Data Mining 5 (5): 446-462, 2012.
  • Rebecca J. Passonneau, Ashish Tomar, Somnath Sarkar, Haimonti Dutta and Axinia Radeva, "Multivariate Assessment of a Repair Program for a New York City Electrical Grid", 11th International Conference on Machine Learning and Applications ICMLA, Special Session on Machine Learning in Energy Applications, Boca Raton, FL, Dec 13 - 15, 2012.
  • Boyi Xie, Rebecca J. Passonneau, Haimonti Dutta, Jing-Yeu Miaw, Axinia Radeva, Ashish Tomar and Cynthia Rudin. "Progressive Clustering with Learned Seeds: An Event Categorization System for Power Grid." 24th International Conference on Software Engineering and Knowledge Engineering (SEKE 2012). Redwood City, CA. July 1-3, 2012.
2011
  • Haimonti Dutta,"A Randomized Gossip-based Algorithm for Classification on Peer-to-Peer Net- works", In Proceedings of the NIPS Workshop on Big Learning: Algorithms, Systems, and Tools for Learning at Scale, Grenada, Spain, Dec 2011.
  • Haimonti Dutta, Huascar Fiorletta, Manoj Pooleery, Hatim Diab, Stanley German, David Waltz, “A Case-Study on Learning from Large-scale Intracranial EEG Data using Multi-core Machines and Clusters", In Proceedings of The Third Workshop on Large-scale Data Mining: Theory and Applications, SIGKDD, San Diego, August, 2011.
  • Shen Wang and Haimonti Dutta, ``PARABLE: A PArallel RAndom-partition Based HierarchicaL ClustEring Algorithm for the MapReduce Framework", Technical Report, CCLS-11-04, 2011.
  • Shen Wang and Haimonti Dutta, “PARABLE: A PArallel RAndom-partition Based HierarchicaL ClustEring Algorithm for the MapReduce Framework", 6th Annual Machine Learning Symposium at the New York Academy of Science (NYAS), 2011.
  • Haimonti Dutta, “Density Estimation Based Ranking from Decision Trees", 6th Annual Machine Learning Symposium at the New York Academy of Science (NYAS), 2011.
  • Leon Wu, Gail Kaiser, Cynthia Rudin, David Waltz, Roger Anderson, Albert Boulanger, Ansaf Salleb-Aouissi, Haimonti Dutta, and Manoj Poolery, "Evaluating Machine Learning for Improving Power Grid Reliability", Proceedings of the Workshop on Machine Learning for Global Challenges, International Conference on Machine Learning (ICML), 2011.
  • Cynthia Rudin, David Waltz, Roger N. Anderson, Albert Boulanger, Ansaf Salleb-Aouissi, Maggie Chow, Haimonti Dutta, Philip Gross, Bert Huang, Steve Ierome, DelÞna Isaac, Arthur Kressner, Rebecca J. Passonneau, Axinia Radeva and Leon Wu, “Machine Learning for the New York City Power Grid", IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2011.
  • Haimonti Dutta, Alex Kamil, Manoj Pooleery, Simha Sethumadhavan and John Demme, "Distributed Storage of Large Scale Multidimensional Electroencephalogram Data using Hadoop and HBase", Book Chapter in Grid and Cloud Database Management, Editors Sandro Fiore and Giovanni Aloisio, Springer, 2011.
  • Haimonti Dutta, Rebecca J. Passonneau, Austin Lee, Axinia Radeva, Boyi Xie, David Waltz and Barbara Taranto, "Learning Parameters of the K-Means Algorithm from Subjective Human Annotation.", The 24th International FLAIRS Conference, Special Track on Data Mining, Palm Beach, FL. May 18-20, 2011.

2010
  • Austin Lee, Haimonti Dutta, Rebecca Passonneau, David Waltz and Barbara Taranto, "Topic Identification from Historic Newspaper Articles of the New York Public Library: A Case Study", 5th Annual Machine Learning Symposium, NYAS, 2010.
  • Margret Una Kjartansdottir, Haimonti Dutta, Catherine A Schevon, Ansaf Salleb-Aouissi, David Waltz and Ronald Emerson, "Detection of High Frequency Oscillations Using Support Vector Machines: A Case Study", WiML, 2010 (Held in Conjunction with NIPS 2010), Vancouver, BC
  • Haimonti Dutta, David Waltz, Karthik M Ramasamy, Phil Gross, Ansaf Salleb-Aouissi, Hatim Diab, Manoj Pooleery, Catherine A Schevon and Ronald Emerson, "Patient-Specific Seizure Detection Fro Intra-cranial EEG Using High Dimensional Clustering, ICMLA, Bethesda, MD, 2010.
  • Haimonti Dutta, David Waltz, Ansaf Salleb-Aouissi, Catherine Schevon and Ronald Emerson, "Designing Patient-Specific Seizure Detectors From Multiple Frequency Bands of Intra-cranial EEG Using Support Vector Machines", Workshop on Data Mining for HealthCare Management held in conjunction with PAKDD, 2010, Hyderabad, India.
  • Cynthia Rudin,Rebecca J. Passonneau, Axinia Radeva, Haimonti Dutta, Steve Ierome, Delfina Isaac, "A process for predicting manhole events in Manhattan.", Machine Learning, 2010. Available here

2009
  • Phil Gross, Ansaf Salleb-Aouissi, Haimonti Dutta and Albert Boulanger, "Ranking Electrical Feeders on the New York Power Grid", ICMLA, 2009.
  • Chase Hensel and Haimonti Dutta, "GERMS: a distributed sub-Gradient ERM Solver", 4th Annual Machine Learning Symposium at the New York Academy of Sciences (NYAS), New York, November, 2009.
  • Haimonti Dutta, David Waltz, Alessandro Moschitti, Daniele Pighin, Philip Gross, Claire Monteleoni, Ansaf Salleb-Aouissi, Albert Boulanger, Manoj Pooleery and Roger Anderson, "Estimating the Time Between Failures of Electrical Feeders in the New York Power Grid", Next Generation Data Mining Summit, NGDM 2009, Columbia MD.
  • Haimonti Dutta, Xianshu Zhu, Tushar Mahule, Hillol Kargupta, Kirk Borne, Codrina Lauth, Florian Holz, and Gerherd Heyer, "TagLearner: A P2P Classifier Learning System from Collaboratively Tagged Text Documents",International Conference on Data Mining (ICDM), Workshop on Mining Multiple Information Sources, December, 2009.
  • Haimonti Dutta, "Measuring Diversity in Regression Ensembles", 4th Indian International Conference on Artificial Intelligence (IICAI), Bangalore, India, December 2009.
  • Chase Hensel and Haimonti Dutta, "GADGET SVM: a Gossip-bAseD sub-GradiEnT SVM Solver", International Conference on Machine Learning (ICML), Numerical Mathematics in Machine Learning Workshop, Montreal, Quebec, 2009.
  • Haimonti Dutta, David Waltz, Catherine A. Schevon, Karthik M Ramasamy, Phil Gross, Ansaf Salleb-Aouissi, Hatim Diab, Manoj Pooleery, Albert Boulanger and Ron Emerson, "Seizure Detection from Multiple Frequency Bands of Intra-cranial EEG using High Dimensional Clustering", 4th International Workshop on Seizure Prediction, Kansas City, MO, June 4th - 7th, 2009.

2008
  • Haimonti Dutta and Hillol Kargupta, "Distributed Linear Programming and Resource Management for Data Mining in Distributed Environments", 10th International Workshop on High Performance Data Mining (HPDM) held in conjunction with the International Conference on Data Mining (ICDM), Pisa Italy.
  • Phil Gross, Ansaf Salleb-Aouissi, Haimonti Dutta and Albert Boulanger, "Ranking Electrical Feeders of the New York Power Grid", 3rd Annual Machine Learning Symposium at the New York Academy of Sciences (NYAS), New York, October, 2008
  • Haoyun Feng, Haimonti Dutta and Ansaf Salleb-Aouissi, "On Improving Probability Estimate Trees", 3rd Annual Workshop for Women in Machine Learning (WiML) held in conjunction with Neural Information Processing Systems (NIPS), Vancouver, B.C., 2008.
  • Haimonti Dutta and Ananda Mathur, "Distributed Optimization Strategies for Mining on Peer-to-Peer Networks", Accepted for publication in International Conference on Machine Learning and Applications (ICMLA), 2008.
  • Chris Giannella, Haimonti Dutta, Kirk Borne, Ran Wolff, Hillol Kargupta, "Distributed Data Mining in Astronomy Catalogs", Technical Report.
  • Haimonti Dutta, Cynthia Rudin, Becky Passonneau, Fred Seibel, Nandini Bhardwaj, Axinia Radeva, Zhi An Liu, Steve Ierome and Delfina Isaac, "Visualization of Manhole and Precursor-Type Events for the Manhattan Electrical Distribution System", Workshop on Geo-Visualization of Dynamics, Movement and Change, 11th AGILE International Conference on Geographic Information Science, Girona, Spain, 2008.

2007
  • "Empowering Scientific Discovery by Distributed Data Mining on the Grid Infrastructure", Ph.D. Thesis, University of Maryland, Baltimore County, 2007.
  • Haimonti Dutta, Chris Giannella, Kirk Borne and Hillol Kargupta,  " Distributed Top-K Outlier Detection in Astronomy Catalogs using the DEMAC system", Accepted for publication at SIAM International Conference on Data Mining, 2007, Minneapolis, USA.

2006
  • Haimonti Dutta, " Empowering Scientific Discovery by Distributed Data Mining on the Grid Infrastructure", Ph.D. Proposal
  • Hillol Kargupta, Byung Hoon Park, Haimonti Dutta. (2006). Orthogonal Decision Trees, IEEE Transactions on Knowledge and Data Engineering, Vol 18, No 7, July 2006. 
  • Haimonti Dutta, "Empowering Scientific Discovery by Distributed Data Mining on the Grid Infrastructure", Proceedings of the IBM Ph.D. Symposium at the International Conference on Service Oriented Computing (ICSOC), 2006. 
  • Chris Giannella, Haimonti Dutta, Sourav Mukherjee and Hillol Kargupta,“Distributed Kernel Density Estimation", 9th International Workshop on High Performance and Distributed Mining, 2006 (HPDM 2006)
  • Chris Giannella, Haimonti Dutta, Ran Wolff, Kirk Borne and Hillol Kargupta, “Distributed Data Mining in Astronomy Databases”, The 9th Workshop on Mining Scientific and Engineering Data Sets(to be held in conjunction with SDM 2006)

2005

  • Haimonti Dutta, Hillol Kargupta, and Anupam Joshi, “Orthogonal Decision Trees for Resource-Constrained Physiological Data Stream Monitoring using Mobile Devices”,High Performance Computing Conference(HiPC 2005), India.

2004
  • Hillol Kargupta and Haimonti Dutta (2004). Orthogonal Decision Trees. The Fourth IEEE International Conference on Data Mining. Brighton, UK, pages 487—490.

2003
  • Haimonti Dutta, Hillol Kargupta, Souptik Datta and Krishanamoorthy Siva Kumar, “Privacy Preserving Data Mining and Random Perturbations”, Workshop on Privacy in the Electronic Society 2003, (in association with 10th ACM Conference on Computer and Communications Security) 
  • Madhu Nayakkankuppam and Haimonti Dutta, "Maximum Likelihood Phylogenetic Tree Construction", Extended Abstract submitted to Graduate Research Conference, University of Maryland, Baltimore County, 2003.

2002
  • Vasilis Megalooikonomou, Haimonti Dutta, Despina Kontos, “Fast and Effective Characterization of 3D Region Data”, International Conference of Image Processing (ICIP) 2002 Rochester, NY.  

Unpublished Manuscripts
  • Despina Kontos, Vasileios Megalooikonomou, Marc Sobel and Haimonti Dutta, "Effective Feature Selection for Characterization and Classification of Spatial Region Data"