Quick Links

Benjamin Fung

Professor Benjamin Fung


Associate Professor

514.398.3360
ben [dot] fung [at] mcgill [dot] ca (Email)

Research website

Research lab: Data Mining & Security (DMaS) Lab


Abridged Curriculum Vitae


Education

PhD (Computing Science), Simon Fraser University
MSc (Computing Science), Simon Fraser University
BSc (Computing Science), Simon Fraser University


Designation

P.Eng. (Software Engineering), Professional Engineers Ontario;
ACM Senior Member;
IEEE Senior Member


Current professional activities

  • Research Scientist, National Cyber-Forensics and Training Alliance Canada (NCFTA Canada)
  • Affiliate Associate Professor, Concordia Institute for Information Systems Engineering (CIISE), Concordia University
  • PC Member, ACM SIGKDD, VLDB, IEEE ICDE, and IEEE ICDM

Teaching & supervision

Teaching

PhD supervision


Research

Research interests

  • Data mining and databases
  • Information security and privacy
  • Information sharing and integration
  • Cloud computing

Research applications

  • Health informatics
  • Crime investigation
  • Authorship analysis
  • Code clone detection
  • Passenger flow analysis
  • Building occupants' behaviour analysis
  • Cross-cultural study in fashion communication

Selected research activities

2013-2018 NSERC Discovery Grants: Privacy-Preserving Data Publishing for Health Data Mining (Principal investigator)
2013-2016 DND/NSERC Research Partnership Project (DNDPJ): Software Fingerprinting for Automated Malicious Code Analysis (Co-investigator)
2012-2015 DND/NSERC Research Partnership Project (DNDPJ)
2011-2014 FQRNT Team Research Project: Towards a Unified Approach to Detecting, Analyzing, Mitigating, and Investigating Botnets (Co-investigator)
2012-2013 Defence Research and Development Canada (DRDC): Semantic Clone Search (Principal investigator)
2010-2013 NSERC Strategic Project Grants: Security and Privacy of User-Generated Data for Personalized Cloud Computing Services (Co-investigator)
2010-2013 NSERC Discovery Grants:  Privacy-Preserving RFID Systems for Data Analysis (Principal investigator)
2011-2012 Defence Research and Development Canada (DRDC): Code Clone Search (Principal investigator)
2012 Defence Research and Development Canada (DRDC): Centre for Security Science (CSS) - Public Security Technical Program (PSTP): Advanced Analytics and Darknet Space Analysis for Predictive Indicators of Cyber Threat Activity (Deputy study project manager)
2010-2012 FQRNT New Researcher Start-up Program: Privacy-Preserving Data Mining for Cybercrime Investigations (Principal investigator)
2011 Defence Research and Development Canada (DRDC): Survey of Code Clone Detection (Principal investigator)
2010-2011 Concordia Seed Funding Program (Team): Data Mining Techniques for Cyber Security Response Systems (Principal investigator)
2009-2010 National Cyber-Forensics Training Alliance Canada (NCFTA Cda): Text Mining for Cybercrime Investigation and Detection (Principal investigator)
2008-2009 Concordia Seed Funding Program (Individual): Techniques for Combating and Mitigating Online Identity Theft (Co-investigator)
2007-2009 Concordia ENCS Faculty Start-up Funds: Privacy-Preserving RFID Systems for Data Analysis (Principal investigator)

Selected publications

2013 and in press

  1. N. Mohammed, D. Alhadidi, B. C. M. Fung, and M. Debbabi. Secure two-party differentially private data release for vertically-partitioned data. IEEE Transactions on Dependable and Secure Computing (TDSC), in press. IEEE Computer Society. [ISI impact factor in 2012: 1.059, 5-year: 1.576]
  2. S. Li, K. Nahar, and B. C. M. Fung. Product customization of tablet computers based on the information of online reviews by customers. Journal of Intelligent Manufacturing (JIM), in press. Springer. [ISI impact factor in 2012: 1.278, 5-year: 2.162]
  3. A. R. M. A. Basher and B. C. M. Fung. Analyzing topics and authors in chat logs for crime investigation. Knowledge and Information Systems (KAIS): An International Journal, in press. Springer. [ISI impact factor in 2011: 2.225, 5-year: 2.151]
  4. S. Goryczka, L. Xiong, and B. C. M. Fung. m-privacy for collaborative data publishing. IEEE Transactions on Knowledge and Data Engineering (TKDE), in press. IEEE Computer Society. [ISI impact factor in 2012: 1.892, 5-year: 2.426]
  5. G. G. Dagher and B. C. M. Fung. Subject-based semantic document clustering for digital forensic investigations. Data & Knowledge Engineering (DKE), 86:224-241, July 2013. Elsevier. [CTV Interview | ISI impact factor in 2012: 1.519, 5-year: 1.710]
  6. Z. Yu, B. C. M. Fung, and F. Haghighat. Extracting knowledge from building-related data - a data mining framework. Building Simulation (BUIL): An International Journal, 6(2):207-222, June 2013. Springer. [ISI impact factor in 2011: 0.815, 5-year: 0.800]
  7. N. Mohammed, X. Jiang, R. Chen, B. C. M. Fung, and L. Ohno-Machado. Privacy-preserving heterogeneous health data sharing. Journal of the American Medical Informatics Association (JAMIA), 20(3):462-469, May 2013. BMJ. [ISI impact factor in 2011: 3.609, 5-year: 4.329]
  8. R. Chen, B. C. M. Fung, N. Mohammed, B. C. Desai, and K. Wang. Privacy-preserving trajectory data publishing by local suppression. Information Sciences (INS): Special Issue on Data Mining for Information Security, 231:83-97, May 2013. Elsevier. [ISI impact factor in 2012: 3.643, 5-year: 3.676]
  9. F. Iqbal, H. Binsalleeh, B. C. M. Fung, and M. Debbabi. A unified data mining solution for authorship analysis in anonymous textual communications. Information Sciences (INS): Special Issue on Data Mining for Information Security, 231:98-112, May 2013. Elsevier. [ISI impact factor in 2012: 3.643, 5-year: 3.676 | the research results were reported by media worldwide]

2012

  1. R. Chen, B. C. M. Fung, B. C. Desai, and N. M. Sossou. Differentially private transit data publication: a case study on the Montreal transportation system. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pages 213-221, Beijing, China: ACM Press, August 2012. [full paper | acceptance ratio: 17.6% = 133/755]
  2. B. C. M. Fung, T. Trojer, P. C. K. Hung, L. Xiong, K. Al-Hussaeni, and R. Dssouli. Service-oriented architecture for high-dimensional private data mashup. IEEE Transactions on Services Computing (TSC), 5(3):373-386, July-September 2012. IEEE Computer Society. [ISI impact factor in 2011: 1.468 | the Spotlight Paper for the July-September 2012 issue]
  3. D. Alhadidi, N. Mohammed, B. C. M. Fung, and M. Debbabi. Secure distributed framework for achieving ε-differential privacy. In Proceedings of the 12th Privacy Enhancing Technologies Symposium (PETS), LNCS 7834, pages 120-139, Vigo, Spain: Springer-Verlag, July 2012. [full paper | acceptance ratio: 22.2% = 16/72]
  4. Z. Yu, F. Haghighat, B. C. M. Fung, and L. Zhou. A novel methodology for knowledge discovery through mining associations between building operational data. Energy and Buildings (ENB), 47:430-440, April 2012. Elsevier. [ISI impact factor in 2011: 2.386, 5-year: 2.809]
  5. R. Al-Zaidy, B. C. M. Fung, A. M. Youssef, and F. Fortin. Mining criminal networks from unstructured text documents. Digital Investigation (DIIN), 8(3-4):147-160, February 2012. Elsevier. [ISI impact factor in 2012: 0.630, 5-year: 0.768]

2011

  1. Z. Yu, F. Haghighat, B. C. M. Fung, E. Morofsky, and H. Yoshino. A methodology for identifying and improving occupant behavior in residential buildings. Energy, 36(11):6596-6608, November 2011. Elsevier. [ISI impact factor in 2012: 3.651, 5-year: 4.107]
  2. N. Mohammed, B. C. M. Fung, and M. Debbabi. Anonymity meets game theory: secure data integration with malicious participants. Very Large Data Bases Journal (VLDBJ), 20(4):567-588, August 2011. Springer Berlin / Heidelberg. [ISI impact factor in 2009: 4.517, 5-year: 6.987]
  3. R. Chen, N. Mohammed, B. C. M. Fung, B. C. Desai, and L. Xiong. Publishing set-valued data via differential privacy. The Proceedings of the VLDB Endowment (PVLDB), 4(11):1087-1098, August 2011. VLDB Endowment. [research track full paper | this journal paper was presented at the 37th International Conference of Very Large Data Bases (VLDB 2011) | acceptance ratio: 18.1% = 100/553]
  4. N. Mohammed, R. Chen, B. C. M. Fung, and P. S. Yu. Differentially private data release for data mining. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pages 493-501, San Diego, CA: ACM Press, August 2011. [full paper | acceptance ratio: 7.8% = 56/714]
  5. Z. Yu, B. C. M. Fung, F. Haghighat, H. Yoshino, and E. Morofsky. A systematic procedure to study the influence of occupant behavior on building energy consumption. Energy and Buildings (ENB), 43(6):1409-1417, June 2011. Elsevier. [ISI impact factor in 2012: 2.679, 5-year: 3.254]

2010

  1. N. Mohammed, B. C. M. Fung, P. C. K. Hung, and C. Lee. Centralized and distributed anonymization for high-dimensional healthcare data. ACM Transactions on Knowledge Discovery from Data (TKDD), 4(4):18:1-18:33, October 2010. ACM Press. [ISI impact factor in 2012: 1.676]
  2. F. Iqbal, H. Binsalleeh, B. C. M. Fung, and M. Debbabi. Mining writeprints from anonymous e-mails for forensic investigation. Digital Investigation (DIIN), 7(1-2):56-64, October 2010. Elsevier. [ISI impact factor in 2010: 0.836, 5-year: 1.043]
  3. Z. Yu, F. Haghighat, B. C. M. Fung, and H. Yoshino. A decision tree method for building energy demand modeling. Energy and Buildings (ENB), 42(10):1637-1646, October 2010. Elsevier. [ISI impact factor in 2012: 2.679, 5-year: 3.254]
  4. B. C. M. Fung, K. Wang, A. W.-C. Fu, and P. S. Yu. Introduction to Privacy-Preserving Data Publishing: Concepts and Techniques, ser. Data Mining and Knowledge Discovery. 376 pages, Chapman & Hall/CRC, August 2010. [ISBN: 9781420091489]
  5. B. C. M. Fung, K. Wang, R. Chen, and P. S. Yu. Privacy-preserving data publishing: a survey of recent developments. ACM Computing Surveys (CSUR), 42(4):14:1-14:53, June 2010. ACM Press. [ISI impact factor in 2010: 8.000, 5-year: 10.910]

2009

  1. T. Trojer, B. C. M. Fung, and P. C. K. Hung. Service-oriented architecture for privacy-preserving data mashup. In Proceedings of the 7th IEEE International Conference on Web Services (ICWS), pages 767-774, Los Angeles, CA: IEEE Computer Society Press, July 2009. [industrial track full paper | acceptance ratio: 18% = 61/339]
  2. N. Mohammed, B. C. M. Fung, P. C. K. Hung, and C. Lee. Anonymizing healthcare data: a case study on the blood transfusion service. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pages 1285-1294, Paris, France: ACM Press, June 2009. [industrial track full paper | video presentation | acceptance ratio: 10% | best student paper award]
  3. B. C. M. Fung, K. Wang, L. Wang, and P. C. K. Hung. Privacy-preserving data publishing for cluster analysis. Data & Knowledge Engineering (DKE), 68(6):552-575, June 2009. Elsevier. [ISI impact factor in 2009: 1.745, 5-year: 2.036]
  4. N. Mohammed, B. C. M. Fung, K. Wang, and P. C. K. Hung. Privacy-preserving data mashup. In Proceedings of the 12th International Conference on Extending Database Technology (EDBT), pages 228-239, Saint-Petersburg, Russia: ACM Press, March 2009. [research track full paper]

2008 and before

  1. F. Iqbal, R. Hadjidj, B. C. M. Fung, and M. Debbabi. A novel approach of mining write-prints for authorship attribution in e-mail forensics. Digital Investigation (DIIN), 5(1):S42-S51. September 2008. Elsevier. [ISI impact factor in 2008: 0.961 | this journal paper was presented at the 8th DFRWS]
  2. B. C. M. Fung, K. Wang, and P. S. Yu. Anonymizing classification data for privacy preservation. IEEE Transactions on Knowledge and Data Engineering (TKDE), 19(5):711-725, May 2007. IEEE Computer Society. [ISI impact factor in 2009: 2.285, 5-year: 3.691]
  3. K. Wang, B. C. M. Fung, and P. S. Yu. Handicapping attacker's confidence: an alternative to k-anonymization. Knowledge and Information Systems (KAIS): An International Journal, 11(3):345-368, April 2007. Springer-Verlag. [ISI impact factor in 2009: 2.211, 5-year: 2.302]
  4. K. Wang and B. C. M. Fung. Anonymizing sequential releases. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), pages 414-423, Philadelphia, PA: ACM Press, August 2006. [research track full paper | research track acceptance ratio: 10.9% = 50/457]
  5. B. C. M. Fung, K. Wang, and M. Ester. Hierarchical document clustering using frequent itemsets. In Proceedings of the 3rd SIAM International Conference on Data Mining (SDM), pages 59-70, San Francisco, CA: SIAM, May 2003. [full paper | acceptance ratio: 19.8% = 21/106]