Crowder: Crowdsourcing entity resolution J Wang, T Kraska, MJ Franklin, J Feng arXiv preprint arXiv:1208.1927, 2012 | 754 | 2012 |
Data cleaning: Overview and emerging challenges X Chu, IF Ilyas, S Krishnan, J Wang Proceedings of the 2016 international conference on management of data, 2201 …, 2016 | 710 | 2016 |
Crowdsourced data management: A survey G Li, J Wang, Y Zheng, MJ Franklin IEEE Transactions on Knowledge and Data Engineering 28 (9), 2296-2319, 2016 | 400 | 2016 |
Activeclean: Interactive data cleaning for statistical modeling S Krishnan, J Wang, E Wu, MJ Franklin, K Goldberg Proceedings of the VLDB Endowment 9 (12), 948-959, 2016 | 320 | 2016 |
Can we beat the prefix filtering? An adaptive framework for similarity join and search J Wang, G Li, J Feng Proceedings of the 2012 ACM SIGMOD international conference on management of …, 2012 | 297 | 2012 |
Leveraging transitive relations for crowdsourced joins J Wang, G Li, T Kraska, MJ Franklin, J Feng Proceedings of the 2013 ACM SIGMOD International Conference on Management of …, 2013 | 268 | 2013 |
Pass-join: A partition-based method for similarity joins G Li, D Deng, J Wang, J Feng arXiv preprint arXiv:1111.7171, 2011 | 253 | 2011 |
QASCA: A quality-aware task assignment system for crowdsourcing applications Y Zheng, J Wang, G Li, R Cheng, J Feng Proceedings of the 2015 ACM SIGMOD international conference on management of …, 2015 | 233 | 2015 |
Fast-join: An efficient method for fuzzy token matching based string similarity join J Wang, G Li, J Fe 2011 IEEE 27th International Conference on Data Engineering, 458-469, 2011 | 214 | 2011 |
Trie-join: Efficient trie-based string similarity joins with edit-distance constraints J Wang, J Feng, G Li Proceedings of the VLDB Endowment 3 (1-2), 1219-1230, 2010 | 209 | 2010 |
Entity matching: How similar is similar J Wang, G Li, JX Yu, J Feng Proceedings of the VLDB Endowment 4 (10), 622-633, 2011 | 177 | 2011 |
A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data J Wang, S Krishnan, MJ Franklin, K Goldberg, T Milo, T Kraska SIGMOD, 2014 | 173 | 2014 |
Massjoin: A mapreduce-based method for scalable string similarity joins D Deng, G Li, S Hao, J Wang, J Feng 2014 IEEE 30th International Conference on Data Engineering, 340-351, 2014 | 164 | 2014 |
Towards dependable data repairing with fixing rules J Wang, N Tang Proceedings of the 2014 ACM SIGMOD international conference on Management of …, 2014 | 156 | 2014 |
Are we ready for learned cardinality estimation? X Wang, C Qu, W Wu, J Wang, Q Zhou arXiv preprint arXiv:2012.06743, 2020 | 126 | 2020 |
Crowdsourced data management: Overview and challenges G Li, Y Zheng, J Fan, J Wang, R Cheng Proceedings of the 2017 ACM international conference on Management of Data …, 2017 | 123 | 2017 |
Learning accurate kinematic control of cable-driven surgical robots using data cleaning and gaussian process regression J Mahler, S Krishnan, M Laskey, S Sen, A Murali, B Kehoe, S Patil, ... 2014 IEEE international conference on automation science and engineering …, 2014 | 96 | 2014 |
Activeclean: An interactive data cleaning framework for modern machine learning S Krishnan, MJ Franklin, K Goldberg, J Wang, E Wu Proceedings of the 2016 international conference on management of data, 2117 …, 2016 | 91 | 2016 |
Trie-join: a trie-based method for efficient string similarity joins J Feng, J Wang, G Li The VLDB Journal 21, 437-461, 2012 | 86 | 2012 |
Clamshell: Speeding up crowds for low-latency data labeling D Haas, J Wang, E Wu, MJ Franklin arXiv preprint arXiv:1509.05969, 2015 | 84 | 2015 |