Home About Research Seminars People Publications
banner

Featured Papers


A GraphBLAS Approach for Subgraph Counting

HarpLDA+: Optimizing Latent Dirichlet Allocation for Parallel Efficiency

Parallel Clustering of High-Dimensional Social Media Data Streams

Harp: Collective Communication on Hadoop
2018
Under review
2017
Presented at IEEE Big Data 2017
2015
Presented at IEEE CCGrid 2015
2015
Presented at IEEE IC2E 2015

Book Contributions

Langshi Chen, Bo Peng, Sabra Ossen, Anil Vullikanti, Madhav Marathe, Lei Jiang and Judy Qiu High-Performance Massive Subgraph Counting using Pipelined Adaptive-Group Communication. Book Series of “HPC and Big Data: Convergence and Ecosystem”,pp173-197, IOS Press, 2018. Doi: 10.3233/978-1-61499-882-2-173.


Zhang, B. Peng, J. Qiu. Parallelizing Big Data Machine Learning Parallelizing Big Data Machine Learning, book series on Advances in Parallel Computing published by IOS Press, 2016.


Tak-Lon (Stephen) Wu, Bingjing Zhang, ClaytonDavis, Emilio Ferrara, Sandro Flammini, Filippo Menczer, Judy Qiu “Scalable Query and Analysis for Social Networks: An Integrated High-Level Dataflow System with Pig and Harp”, Book chapter to be published in Big Data in Complex and Social Networks handbook, CRC Press, Taylor & Francis Group, 2016.


Journal Papers

Zhao Zhao, Meng Li, Mihai Avram, Guanying Wang, Ali Butt, Maleq Khan, Madhav Marathe, Judy Qiu, Anil Vullikanti Finding and counting tree-like subgraphs using MapReduce, Journal of IEEE Transactions on Multi-Scale Computing Systems, Volume: 4 Issue: 3, July-September 1, 2018.


Clayton A. Davis, Giovanni Luca Ciampaglia1, Luca Maria Aiello, Keychul Chung, Michael Conover, Emilio Ferrara, Alessandro Flammini, Geoffrey Fox, Xiaoming Gao, Bruno Gonçalves, Przemyslaw Grabowicz, Alex Hong, Pik-Mai Hui, Scott McCaulay, Karissa McKelvey, Mark Meiss, Snehal Patil, Chathuri Peli Kankanamalage, Valentin Pentchev, Judy Qiu, Jacob Ratkiewicz, Alex Rudnick, Benjamin Serrette, Prashant Shiralkar, Onur Varol, Lilian Weng, Tak-Lon Wu, Andrew Younge, and Filippo Menczer. OSoME: The IUNI Observatory on Social Media, PeerJ Computer Science 2:e87, October 3, 2016. Zhao Zhao, Meng Li, Mihai Avram, Guanying Wang, Ali Butt, Maleq Khan, Madhav Marathe, Judy Qiu, Anil Vullikanti


Refereed Conference and Workshop Proceedings

Lei Jiang, Langshi Chen, Judy Qiu Performance Characterization of Multi-threaded Graph Processing Applications on Many-Integrated-Core Architecture, IEEE International Symposium on Performance Analysis of Systems and Software, (ISPASS) held in Belfast, Northern Ireland, UK, April 2-4, 2018.


Bo Peng, Bingjing Zhang, Langshi Chen, Mihai Avram, Robert Henschel, Craig Stewart, Shaojuan Zhu, Emily Mccallum, Lisa Smith, Tom Zahniser, Jon Omer, and Judy Qiu, HarpLDA+: Optimizing Latent Dirichlet Allocation for Parallel Efficiency, Proceedings of the IEEE Big Data Conference 2017 held on December 11-14, 2017. Bigdata_Harp_LDA.pdf


Judy Qiu, Supun Kamburugamuve, Hyungro Lee, Jerome Mitchell, Rebecca Caldwell, Gina Bullock and Linda Hayden. "Teaching, Learning and Collaborating through Cloud Computing Online Classes", in the proceedings of the Workshop on Education for High-Performance Computing (EduHPC-17), Denver, Colorado. November 13, 2017.


Langshi Chen, Bo Peng, Bingjing Zhang, Tony Liu, Lei Jiang, Robert Henschel, Craig Stewart, Zhang Zhang, Emily Mccallum, Tom Zahniswer, Jon Omer, Judy Qiu, Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters, Proceedings of IEEE International Conference on Cloud Computing (IEEE Cloud 2017), held in Honolulu, Hawaii, June 25-30, 2017. Harp-DAAL.pdf


Bingjing Zhang, Bo Peng and Judy Qiu, Model-Centric Computation Abstractions in Machine Learning Applications, submitted to the 3rd Workshop on Algorithms and Systems for MapReduce and Beyond (BeyondMR2016), held in conjunction with SIGMOD Conference, July 1, 2016. Computation Abstractions.pdf


Bingjing Zhang, Bo Peng, Judy Qiu, High Performance LDA through Collective Model Communication Optimization, Proceedings of International Conference on Computational Science (ICCS2016) Conference, June 6-8, 2016, San Diego, California. Harp-lda.pdf


Xiaoming Gao, Emilio Ferrara, Judy Qiu, Parallel Clustering of High-Dimensional Social Media Data Streams, Presented at CCGrid2015 the 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing Conference (CCGrid 2015: acceptance rate 25.7%), Shenzhen, China, May 4-7, 2015.


Bingjing Zhang, Yang Ruan, Judy Qiu. Harp: Collective Communication on Hadoop Short paper in the Proceedings of IEEE International Conference on Cloud Engineering (IC2E), held in Tempe, Arizona, March 9-12, 2015.


Other Publications

Judy Qiu, Harp-DAAL for High Performance Big Data Computing, Intel Parallel Universe Magazine, March 17, 2018.


Tutorial on Harp-DAAL: A High Performance Machine Learning Framework for HPC-Cloud at Intel® HPC Developer Conference (HPCDC), held in Sheraton Denver Downtown Hotel, Denver, Colorado, November 11-12, 2017.


Bingjing Zhang, Bo Peng, Judy Qiu, Parallelizing Big Data Machine Learning Applications with Model Rotation, book chapter in New Frontiers in High Performance Computing, ISO Press,2017. Model Rotation.pdf


Kai ZheN, Mridul Birla, David Crandall, Bingjing Zhang, Judy Qiu, 3/15/2017 A Hybrid Supervised-unsupervised Method on Image Topic Visualization with Convolutional Neural Network and LDA, Indiana University


E. Gámiz, A. Bazavovb, C. Bernardc, C. DeTard, D. Due, A.X. El-Khadraf, E.D. Freelandg, Steven Gottliebh, U.M. Helleri, J. Komijanij, A.S. Kronfeldj,k, J. Laihoe, P.B. Mackenziek, E.T.Neill, T. Primerm, J.N. Simonek, R. Sugarn, D. Toussaintm, R.S. Van de Waterk, and Ran Zhou, 11/20/2016 Kaon semileptonic decays with Nf = 2+1+1 HISQ fermions and physical light-quark masses, Cornell University Library


Ruizi Li, Carleton DeTar, Douglas Doerfler, Steven Gottlieb, Ashish Jha, Dhiraj Kalamkar, Doug Toussaint, 11/3/2016 MILC staggered conjugate gradient performance on Intel KNL, Cornell University Library


Carleton DeTar, Douglas Doerfler, Steven Gottlieb, Ashish Jha, Balint Joo, Dhiraj Kalamkar, Ruizi Li, Doug Toussaint, 9/21/2016 MILC Staggered Conjugate Gradient Performance on Intel KNL, IXPUG


Ashish Jha, Vitali Morozov, Jack Deslippe, 9/19/2016 Vectorization Strategies for Intel's 2nd Generation Intel® Xeon Phi™ Architecture Codenamed Knights Landing, Argonne National Labs


Carleton DeTar, Douglas Doerfler, Steven Gottlieb, Ashish Jha, Balint Joo, Dhiraj Kalamkar, Ruizi Li, Doug Toussaint, 9/19/2016 MILC Staggered Conjugate Gradient Intel KNL, Argonne National Labs


Bingjing Zhang, A Collective Communication Layer for the Software Stack of Big Data Analytics, Doctoral Symposium. Proceedings of IEEE International Conference on Cloud Engineering (IC2E2016) Conference, April 4-8, 2016, Berlin, Germany. Collective Communication_paper.pdf


Bingjing Zhang, Peng Bo, Judy Qiu, 3/11/2016, Parallelizing Big Data Machine Learning Algorithms with Model Rotation, Semantic Scholar


Bingjing Zhang, Bo Peng, Judy Qiu, Parallel LDA Through Synchronized Communication Optimizations LDA_optimization_paper.pdf


News

We gave a 2 hour Tutorial on Harp-DAAL: A high Performance Machine Learning Framework for HPC-Cloud, at Intel® HPC Developer Conference (HPCDC) 2017 held in Sheraton Denver Downtown Hotel, Denver, Colorado, November 11-12, 2017.

Announcements

Intel hosted a two-day training session at the Indiana University-Bloomington campus on Sept. 2015. The course covered parallel computing using Intel Xeon processors. For more detailed information, click on the image to the right.
Affiliated sites Contact
Thomas Wiggins
email: wigginst(at)indiana.edu