Xiao Chen

AG Datenbanken & Software Engineering
Universitätsplatz 2, 39106, Magdeburg, G29-126
  • Xiao Chen, Nishanth Entoor Venkatarathnam, Kirity Rapuru, David Broneske, Gabriel Campero Durand, Roman Zoun, and Gunter Saake. Analysis and Comparison of Block-Splitting-Based Load Balancing Strategies for Parallel Entity Resolution. In International Conference on Information Integration and Web-based Applications & Services (iiWAS2020). ACM, November 2020. Accepted.
  • Xiao Chen. Towards Efficient and Effective Entity Resolution for High-Volume and Variable Data. PhD thesis, University of Magdeburg, November 2020.
  • Xiao Chen, Yinlong Xu, David Broneske, Gabriel Campero Durand, Roman Zoun, and Gunter Saake. Heterogeneous Committee-Based Active Learning for Entity Resolution (HeALER). In European Conference on Advances in Databases and Information Systems (ADBIS), LNCS, pages 69–85, September 2019.
  • Roman Zoun, Kay Schallert, David Broneske, Ivayla Trifonova, Xiao Chen, Robert Heyer, Dirk Benndorf, and Gunter Saake. Efficient Transformation of Protein Sequence Databases to Columnar Index Schema. In International Workshop on Biological Knowledge Discovery and Data Mining (BIOKDD-DEXA), volume 1062 of CCIS, pages 67–72. IEEE, August 2019.
  • Xiao Chen, Gabriel Campero Durand, Roman Zoun, David Broneske, Yang Li, and Gunter Saake. The Best of Both Worlds: Combining Hand-Tuned and Word-Embedding-Based Similarity Measures for Entity Resolution. In Datenbanksysteme für Business, Technologie und Web, pages 215 – 224, March 2019.
  • Xiao Chen, Roman Zoun, Eike Schallehn, Sravani Mantha, Kirity Rapuru, and Gunter Saake. Exploring Spark-SQL-Based Entity Resolution Using the Persistence Capability. In Beyond Databases, Architectures and Structures, pages 3–17, September 2018.
  • Xiao Chen, Kirity Rapuru, Gabriel Campero Durand, and Eike Schallehn. Performance Comparison of Three Spark-Based Implementations of Parallel Entity Resolution. In International Workshop on Big Data Management in Cloud Systems (BDMICS-DEXA), pages 76–87. Springer, September 2018.
  • Xiao Chen, Eike Schallehn, and Gunter Saake. Cloud-Scale Entity Resolution: Current State and Open Challenges. Open Journal of Big Data, 4(1):30–51, April 2018. (PDF)
  • Xiao Chen. Crowdsourcing Entity Resolution: a Short Overview and Open Issues. In Proceedings of the 27th GI-Workshop Grundlagen von Datenbanken, Gommern, Germany, May 26-29, 2015., pages 72–77, 2015.
  • Xiao Chen. An Overview and Classification of Current Research on Crowdprocessing and Databases. Master thesis, University of Magdeburg, Germany, March 2014.


  • Parallel Entity Resolution
  • Apache Spark
  • Crowdsourcing

