Here, you can find a list of my publications. A more up-to-date list can usually be found on my Google Scholar page. 

  1. Edward Raff, Michel Benaroch, Sagar Samtani and Andrew L Farris. What Do Machine Learning Researchers Mean by “Reproducible”?. In The Thirty-Ninth AAAI Conference on Artificial Intelligence. 2025. URL BibTeX

  2. John Hurwitz, Charles Nicholas and Edward Raff. Neural Normalized Compression Distance and the Disconnect Between Compression and Classification. In Machine Learning and Compression Workshop at NeurIPS 2024. December 2024. URL BibTeX

  3. Amol Khanna, Adam McCormick, Andre Nguyen, Chris Aguirre and Edward Raff. Position: Challenges and Opportunities for Differential Privacy in the US Federal Government. In 2nd Workshop on Regulatable ML at NeurIPS 2024. December 2024. URL BibTeX

  4. Chang Liu, Rebecca Saul, Yihao Sun, Edward Raff, Maya Fuchs, Townsend Southard Pantano, James Holt and Kristopher Micinski. Assemblage: Automatic Binary Dataset Construction for Machine Learning. In The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track. December 2024. URL BibTeX

  5. Rebecca Saul, Chang Liu, Noah Fleischmann, Richard J Zak, Kristopher Micinski, Edward Raff and James Holt. Is Function Similarity Over-Engineered? Building a Benchmark. In The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track. December 2024. URL BibTeX

  6. Mohammad Mahmudul Alam, Alexander Oberle, Edward Raff, Stella Biderman, Tim Oates and James Holt. A Walsh Hadamard Derived Linear Vector Symbolic Architecture. In The Thirty-eighth Annual Conference on Neural Information Processing Systems. December 2024. URL BibTeX

  7. Skyler Wu, Fred Lu, Edward Raff and James Holt. Stabilizing Linear Passive-Aggressive Online Learning with Weighted Reservoir Sampling. In The Thirty-eighth Annual Conference on Neural Information Processing Systems. December 2024. URL BibTeX

  8. James Holt and Edward Raff. Malware Bytes. In The Next Wave: Cyber Analytics Research 25. April 2024. URL PDF BibTeX

  9. Amol Khanna, Edward Raff and Nathan Inkawhich. SoK: A Review of Differentially Private Linear Models For High-Dimensional Data. In 2024 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML) (). April 2024, 57-77. URL, DOI BibTeX

  10. Sagar Samtani, Edward Raff and Hyrum Anderson. Applied Machine Learning for Information Security. Digital Threats 5(1), April 2024. URL, DOI BibTeX

  11. Ethan M Rudd, David Krisiloff, Scott Coull, Daniel Olszewski, Edward Raff and James Holt. Efficient Malware Analysis Using Metric Embeddings. Digital Threats 5(1), March 2024. URL, DOI BibTeX

  12. Mohammad Mahmudul Alam, Edward Raff, Stella R Biderman, Tim Oates and James Holt. Holographic Global Convolutional Networks for Long-Range Prediction Tasks in Malware Detection. In Proceedings of The 27th International Conference on Artificial Intelligence and Statistics 238. 2024, 4042–4050. URL PDF BibTeX

  13. Edward Raff and Cynthia Matuszek. Does Starting Deep Learning Homework Earlier Improve Grades?. pages 381–396, Springer Nature Switzerland, 2024. URL, DOI BibTeX

  14. Ashley Klein, Edward Raff, Elisabeth Seamon, Lily Foley and Timothy Bussert. More Options for Prelabor Rupture of Membranes, A Bayesian Analysis. In 11th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2024. 2024.
    Best Paper Award. URL BibTeX

  15. Ryan Swope, Amol Khanna, Philip Doldo, Saptarshi Roy and Edward Raff. Feature Selection from Differentially Private Correlations. In Proceedings of the 17th ACM Workshop on Artificial Intelligence and Security (AISec'24). 2024. URL BibTeX

  16. Francis Ferraro Kasra Darvish Edward Raff and Cynthia Matuszek. Multimodal Language Learning for Object Retrieval in Low Data Regimes in the Face of Missing Modalities. Transactions on Machine Learning Research, October 2023. URL BibTeX

  17. Zheng Xin Yong, Hailey Schoelkopf, Niklas Muennighoff, Alham Fikri Aji, David Ifeoluwa Adelani, Khalid Almubarak, Saiful M Bari, Lintang Sutawika, Jungo Kasai, Ahmed Baruwa, Genta Winata, Stella Biderman, Edward Raff, Dragomir Radev and Vassilina Nikoulina. BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). July 2023, 11682–11703. URL, DOI BibTeX

  18. Catherine Ordun, Edward Raff and Sanjay Purushotham. When Visible-to-Thermal Facial GAN Beats Conditional Diffusion. In 2023 IEEE International Conference on Image Processing (ICIP) (). 2023, 181-185. DOI BibTeX

  19. Amol Khanna, Fred Lu and Edward Raff. The Challenge of Differentially Private Screening Rules. 2nd AdvML Frontiers Workshop at 40th International Conference on Machine Learning, 2023. URL BibTeX

  20. Robert J Joyce, Tirth Patel, Charles Nicholas and Edward Raff. AVScan2Vec: Feature Learning on Antivirus Scan Data for Production-Scale Malware Corpora. In Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security. 2023, 185–196. URL, DOI BibTeX

  21. Catherine Ordun, Edward Raff and Sanjay Purushotham. Vista Morph - Unsupervised Image Registration of Visible-Thermal Facial Pairs. In 2023 IEEE International Joint Conference on Biometrics (IJCB) (). 2023, 1-10. URL, DOI BibTeX

  22. Edward Raff and Andrew L Farris. A Siren Song of Open Source Reproducibility, Examples from Machine Learning. In Proceedings of the 2023 ACM Conference on Reproducibility and Replicability. 2023, 115–120. URL, DOI BibTeX

  23. Tyler LeBlond, Joseph Munoz, Fred Lu, Maya Fuchs, Elliot Zaresky-Williams, Edward Raff and Brian Testa. Probing the Transition to Dataset-Level Privacy in ML Models Using an Output-Specific and Data-Resolved Privacy Profile. In Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security. 2023, 23–33. URL, DOI BibTeX

  24. Catherine Ordun, Alexandra Cha, Edward Raff, Sanjay Purushotham, Karen Kwok, Mason Rule and James Gulley. A Generative Approach for Image Registration of Visible-Thermal (VT) Cancer Faces. pages 91–100, Springer Nature Switzerland, 2023. URL, DOI BibTeX

  25. Corey J Nolet, Divye Gala, Alex Fender, Mahesh Doijade, Joe Eaton, Edward Raff, John Zedlewski, Brad Rees and Tim Oates. cuSLINK: Single-Linkage Agglomerative Clustering on the GPU. pages 711–726, Springer Nature Switzerland, 2023. URL, DOI BibTeX

  26. Edward Raff, Mark McLean and James Holt. An Easy Rejection Sampling Baseline via Gradient Refined Proposals. IOS Press, 2023. URL, DOI BibTeX

  27. Robert J Joyce, Edward Raff, Charles Nicholas and James Holt. MalDICT: Benchmark Datasets on Malware Behaviors, Platforms, Exploitation, and Packers. Proceedings of the Conference on Applied Machine Learning in Information Security, 2023. URL BibTeX

  28. Edward Raff and James Holt. Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023. URL BibTeX

  29. Mohammad Mahmudul Alam, Edward Raff, Tim Oates and Cynthia Matuszek. DDxT: Deep Generative Transformer Models for Differential Diagnosis. Deep Generative Models for Health Workshop NeurIPS 2023, 2023. BibTeX

  30. Stella Biderman, USVSN Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit and Edward Raff. Emergent and predictable memorization in large language models. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023. BibTeX

  31. Nora Belrose, David Schneider-Joseph, Shauli Ravfogel, Ryan Cotterell, Edward Raff and Stella Biderman. LEACE: Perfect linear concept erasure in closed form. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023. URL, DOI BibTeX

  32. Edward Raff, Amol Ashish Khanna and Fred Lu. Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations. In Thirty-seventh Conference on Neural Information Processing Systems. 2023. URL BibTeX

  33. Luke E Richards, Edward Raff and Cynthia Matuszek. Measuring Equality in Machine Learning Security Defenses: A Case Study in Speech Recognition. In Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security. 2023, 161–171. URL, DOI BibTeX

  34. Amol Khanna, Fred Lu, Edward Raff and Brian Testa. Differentially Private Logistic Regression with Sparse Solutions. In Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security. 2023, 1–9. URL, DOI BibTeX

  35. Mohammad Mahmudul Alam, Edward Raff and Tim Oates. Towards Generalization in Subitizing with Neuro-Symbolic Loss using Holographic Reduced Representations. Neuro-Symbolic Learning and Reasoning in the era of Large Language Models, 2023. URL, DOI BibTeX

  36. Tirth Patel, Fred Lu, Edward Raff, Charles Nicholas, Cynthia Matuszek and James Holt. Small Effect Sizes in Malware Detection? Make Harder Train/Test Splits!. Proceedings of the Conference on Applied Machine Learning in Information Security, 2023. URL BibTeX

  37. Mohammad Mahmudul Alam, Edward Raff, Stella Biderman, Tim Oates and James Holt. Recasting self-attention with holographic reduced representations. In Proceedings of the 40th International Conference on Machine Learning. 2023. BibTeX

  38. Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika and Oskar Van Der Wal. Pythia: a suite for analyzing large language models across training and scaling. In Proceedings of the 40th International Conference on Machine Learning. 2023. BibTeX

  39. Fred Lu, Edward Raff and James Holt. A coreset learning reality check. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence. 2023. URL, DOI BibTeX

  40. Marcia DesJardin, Edward Raff, Angelina Stewart, Nicholas Baranco and Dimitrios Mastrogiannis. Comparison of two methods of antepartum anticoagulation: enoxaparin until scheduled labor versus transitioning to heparin. American Journal of Obstetrics and Gynecology 228(1):S531–S532, 2023. URL, DOI BibTeX

  41. Robert J Joyce, Dev Amlani, Charles Nicholas and Edward Raff. MOTIF: A Malware Reference Dataset with Ground Truth Family Labels. Computers & Security 124:102921, 2023. URL, DOI BibTeX

  42. Fred Lu, Edward Raff and Francis Ferraro. Neural Bregman Divergences for Distance Learning. In The Eleventh International Conference on Learning Representations. 2023. URL BibTeX

  43. Rebecca Saul, Mohammad Mahmudul Alam, John Hurwitz, Edward Raff, Tim Oates and James Holt. Lempel-Ziv Networks. In Proceedings on "I Can't Believe It's Not Better! - Understanding Deep Learning Through Empirical Falsification" at NeurIPS 2022 Workshops 187. 2023, 1–11. URL PDF BibTeX

  44. Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, Saiful M Bari, Sheng Shen, Zheng Xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff and Colin Raffel. Crosslingual Generalization through Multitask Finetuning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2023, 15991–16111. URL, DOI BibTeX

  45. Mike Wong, Edward Raff, James Holt and Ravi Netravali. Marvolo: Programmatic Data Augmentation for Deep Malware Detection. In Machine Learning and Knowledge Discovery in Databases: Research Track: European Conference, ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Proceedings, Part I. 2023, 270–285. URL, DOI BibTeX

  46. Marcia DesJardin, Edward Raff, Nicholas Baranco and Dimitrios Mastrogiannis. Cross-Sectional Survey of High-Risk Pregnant Women's Opinions on COVID-19 Vaccination. Women's Health Reports 3(1):608–616, June 2022. URL, DOI BibTeX

  47. Marcia DesJardin, Edward Raff, Nicholas Baranco and Dimitrios Mastrogiannis. Pregnant Women's Opinions on the COVID-19 Vaccination in Pregnancy [A301]. Obstetrics & Gynecology 139(1):87S–87S, May 2022. URL, DOI BibTeX

  48. Stella Biderman and Edward Raff. Fooling MOSS Detection with Pretrained Language Models. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022, 2933–2943. URL, DOI BibTeX

  49. Fred Lu, Joseph Munoz, Maya Fuchs, Tyler LeBlond, Elliott Zaresky-Williams, Edward Raff, Francis Ferraro and Brian Testa. A General Framework for Auditing Differentially Private Machine Learning. In Advances in Neural Information Processing Systems 35. 2022, 4165–4176. URL BibTeX

  50. Rebecca J Newbrander, Edward Raff, Katherine Frega and Mary J Cunningham. Eliminating postoperative opioid prescriptions is associated with lower long term opioid use (523). Gynecologic Oncology, 2022. URL, DOI BibTeX

  51. Robert J Joyce, Dev Amlani, Charles Nicholas and Edward Raff. MOTIF: A Large Malware Reference Dataset with Ground Truth Family Labels. In The AAAI-22 Workshop on Artificial Intelligence for Cyber Security (AICS). 2022. URL, DOI BibTeX

  52. Gaoussou Youssouf Kebe, Luke E Richards, Edward Raff, Francis Ferraro and Cynthia Matuszek. Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech. In AAAI. 2022. URL BibTeX

  53. Andre T Nguyen, Fred Lu, Gary Lopez Munoz, Edward Raff, Charles Nicholas and James Holt. Out of Distribution Data Detection Using Dropout Bayesian Neural Networks. In Proceedings of the 36th AAAI Conference on Artificial Intelligence. 2022. URL BibTeX

  54. Fred Lu, Francis Ferraro and Edward Raff. Continuously Generalized Ordinal Regression for Linear and Deep Models. In SIAM International Conference on Data Mining (SDM22). 2022. URL BibTeX

  55. Corey J Nolet, Divye Gala, Edward Raff, Joe Eaton, Brad Rees, John Zedlewski and Tim Oates. Semiring Primitives for Sparse Neighborhood Methods on the GPU. In MLSys Conference. 2022.
    Outstanding Paper Award (1 of 5). URL BibTeX

  56. Edward Raff. Does the Market of Citations Reward Reproducible Work?. In ML Evaluation Standards Workshop at ICLR 2022. 2022. URL, DOI BibTeX

  57. Edward Raff and Andrew L Farris. A Siren Song of Open Source Reproducibility. In ML Evaluation Standards Workshop at ICLR 2022. 2022.
    Outstanding Paper Award (1 of 5). URL, DOI BibTeX

  58. Mohammad Mahmudul Alam, Edward Raff, Tim Oates and James Holt. Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced Representations. In International Conference on Machine Learning. 2022. URL BibTeX

  59. Stella Biderman and Edward Raff. Neural Language Models are Effective Plagiarists. arXiv, 2022. URL, DOI BibTeX

  60. Katherine Crowson, Stella Biderman, Daniel Kornis, Dashiell Stander, Eric Hallahan, Louis Castricato and Edward Raff. VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance. In ECCV. 2022. URL, DOI BibTeX

  61. A T Nguyen, L E Richards, G Kebe, Edward Raff, K Darvish, F Ferraro and C Matuszek. Practical Cross-modal Manifold Alignment for Robotic Grounded Language Learning. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) . June 2021, 1613-1622. URL PDF, DOI BibTeX

  62. Robert J Joyce, Edward Raff and Charles Nicholas. Rank-1 Similarity Matrix Decomposition For Modeling Changes in Antivirus Consensus Through Time. In Proceedings of the Conference on Applied Machine Learning for Information Security. 2021. URL BibTeX

  63. Robert J Joyce, Edward Raff and Charles Nicholas. A Framework for Cluster and Classifier Evaluation in the Absence of Reference Labels. In Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security (AISec '21). 2021. URL, DOI BibTeX

  64. Corey J Nolet, Victor Lafargue, Edward Raff, Thejaswi Nanditale, Tim Oates, John Zedlewski and Joshua Patterson. Bringing UMAP Closer to the Speed of Light with GPU Acceleration. In The Thirty-Fifth AAAI Conference on Artificial Intelligence. 2021. URL BibTeX

  65. Edward Raff. Research Reproducibility as a Survival Analysis. In The Thirty-Fifth AAAI Conference on Artificial Intelligence. 2021. URL BibTeX

  66. James Holt and Edward Raff. RaNdOm Is RoBuSt: Using Randomness to Make Classifiers Resistant to Attack. The Next Wave 23(1):60–59, 2021. URL BibTeX

  67. Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Naz Sepah, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Dmitriy Serdyuk, Tal Arbel, Chris Pal, Gaël Varoquaux and Pascal Vincent. Accounting for Variance in Machine Learning Benchmarks. In Machine Learning and Systems (MLSys). 2021. URL BibTeX

  68. Edward Raff. Exact Acceleration of K-Means ++ and K-Means. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21. 2021, 2928–2935. URL, DOI BibTeX

  69. Catherine Ordun, Edward Raff and Sanjay Purushotham. Generating Thermal Human Faces for Physiological Assessment Using Thermal Sensor Auxiliary Labels. In ICIP. 2021. URL BibTeX

  70. Andre T Nguyen, Edward Raff, Charles Nicholas and James Holt. Leveraging Uncertainty for Improved Static Malware Detection Under Extreme False Positive Constraints. In IJCAI-21 1st International Workshop on Adaptive Cyber Defense. 2021. URL BibTeX

  71. Edward Raff, William Fleshman, Richard Zak, Hyrum S Anderson, Bobby Filar and Mark McLean. Classifying Sequences of Extreme Length with Constant Memory Applied to Malware Detection. In The Thirty-Fifth AAAI Conference on Artificial Intelligence. 2021. URL BibTeX

  72. Luke E Richards, André Nguyen, Ryan Capps, Steven Forsythe, Cynthia Matuszek and Edward Raff. Adversarial Transfer Attacks With Unknown Data and Class Overlap. In Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security (AISec '21). 2021. URL, DOI BibTeX

  73. Gaoussou Youssouf Kebe, Padraig Higgins, Patrick Jenkins, Kasra Darvish, Ryan Barron, John Winder, Don Engel, Edward Raff, Francis Ferraro, Cynthia Matuszek and Booz Allen Hamilton. A Spoken Language Dataset of Descriptions for Speech-Based Grounded Language Learning. In NeurIPS. 2021. URL BibTeX

  74. Ashwinkumar Ganesan, Hang Gao, Sunil Gandhi, Edward Raff, Tim Oates, James Holt and Mark McLean. Learning with Holographic Reduced Representations. In Advances in Neural Information Processing Systems. 2021. URL BibTeX

  75. Catherine Ordun, Alexandra N Cha, Edward Raff, Byron Gaskin, Alex Hanson, Mason Rule, Sanjay Purushotham and James L Gulley. Intelligent Sight and Sound : A Chronic Cancer Pain Dataset. In NeurIPS. 2021. URL BibTeX

  76. John Boutsikas, Maksim E Eren, Charles Varga, Edward Raff, Cynthia Matuszek and Charles Nicholas. Evading Malware Classifiers via Monte Carlo Mutant Feature Discovery. In Malware Technical Exchange Meeting. 2021. URL BibTeX

  77. Edward Raff, Bobby Filar and James Holt. Getting Passive Aggressive About False Positives: Patching Deployed Malware Detectors. In 2020 International Conference on Data Mining Workshops (ICDMW). November 2020, 506–515. URL, DOI BibTeX

  78. Wenbin Zhang, Mingli Zhang, Ji Zhang, Zhen Liu, Zhiyuan Chen, Jianwu Wang, Edward Raff and Enza Messina. Flexible and Adaptive Fairness-aware Learning in Non-stationary Data Streams. In 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI). November 2020, 399–406. URL, DOI BibTeX

  79. Edward Raff, Charles Nicholas and Mark McLean. A New Burrows Wheeler Transform Markov Distance. In The Thirty-Fourth AAAI Conference on Artificial Intelligence. 2020, 5444–5453. URL, DOI BibTeX

  80. Arash Rahnama, Andre T Nguyen and Edward Raff. Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory. In The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2020, 8178–8187. URL BibTeX

  81. Catherine Ordun, Sanjay Purushotham and Edward Raff. Exploratory Analysis of Covid-19 Tweets using Topic Modeling, UMAP, and DiGraphs. In epiDAMIK 2020: 3rd epiDAMIK ACM SIGKDD International Workshop on Epidemiology meets Data Mining and Knowledge Discovery. 2020. URL BibTeX

  82. Edward Raff and Charles Nicholas. A Survey of Machine Learning Methods and Challenges for Windows Malware Classification. In NeurIPS 2020 Workshop: ML Retrospectives, Surveys & Meta-Analyses (ML-RSA). 2020.
    Best Paper Award. URL BibTeX

  83. Patrick Jenkins, Rishabh Sachdeva, Gaoussou Youssouf Kebe, Padraig Higgins, Kasra Darvish, Edward Raff, Don Engel, John Winder, Francisco Ferraro and Cynthia Matuszek. Presentation and Analysis of a Multimodal Dataset for Grounded LanguageLearning. arXiv, 2020. URL BibTeX

  84. Maksim Ekin Eren, Nick Solovyev, Edward Raff, Charles Nicholas and Ben Johnson. COVID-19 Kaggle Literature Organization. In Proceedings of the ACM Symposium on Document Engineering 2020. 2020, 1–4. URL, DOI BibTeX

  85. Edward Raff, Richard Zak, Gary Lopez Munoz, William Fleming, Hyrum S Anderson, Bobby Filar, Charles Nicholas and James Holt. Automatic Yara Rule Generation Using Biclustering. In 13th ACM Workshop on Artificial Intelligence and Security (AISec'20). 2020.
    Best Paper Award. URL, DOI BibTeX

  86. Catherine Ordun, Edward Raff and Sanjay Purushotham. The Use of AI for Thermal Emotion Recognition: A Review of Problems and Limitations in Standard Design and Data. In AAAI FSS-20: Artificial Intelligence in Government and Public Sector. 2020. URL BibTeX

  87. Nisha Pillai, Edward Raff, Francis Ferraro and Cynthia Matuszek. Sampling Approach Matters: Active Learning for Robotic Language Acquisition. In 2020 IEEE International Conference on Big Data (Big Data). 2020. URL BibTeX

  88. Jared Sylvester and Edward Raff. Trimming the Thorns of AI Fairness Research. Data Engineering 43(4):74–84, 2020. URL BibTeX

  89. Andre T Nguyen, Edward Raff and Aaron Sant-Miller. Would a File by Any Other Name Seem as Malicious?. In 2019 IEEE International Conference on Big Data (Big Data). December 2019, 1322–1331. URL, DOI BibTeX

  90. Ashley Klein, Julio J Jauregui, Edward Raff, Frank R Henn, Ashfaq S Hasan and Mohit Gilotra. Early outcomes and complications of obese patients undergoing shoulder arthroplasty: A meta-analysis. Journal of Clinical Orthopaedics and Trauma, September 2019. URL, DOI BibTeX

  91. William Fleshman, Edward Raff, Jared Sylvester, Steven Forsyth and Mark McLean. Non-Negative Networks Against Adversarial Attacks. AAAI-2019 Workshop on Artificial Intelligence for Cyber Security, 2019. URL BibTeX

  92. Andre T Nguyen and Edward Raff. Adversarial Attacks, Regression, and Numerical Stability Regularization. In The AAAI-19 Workshop on Engineering Dependable and Secure Machine Learning Systems. 2019. URL BibTeX

  93. Edward Raff, Shannon Lantzy and Ezekiel J Maier. Dr. AI, Where Did You Get Your Degree?. In Artificial Intelligence in Health. 2019, 76–83. URL BibTeX

  94. E Raff and J Sylvester. Gradient reversal against discrimination: A fair neural network learning approach. In Proceedings - 2018 IEEE 5th International Conference on Data Science and Advanced Analytics, DSAA 2018. 2019. URL, DOI BibTeX

  95. Edward Raff, Joe Aurelio and Charles Nicholas. PyLZJD: An Easy to Use Tool for Machine Learning. In Proceedings of the 18th Python in Science Conference. 2019, 97–102. URL, DOI BibTeX

  96. Arash Rahnama, Andre T Nguyen and Edward Raff. Connecting Lyapunov Control Theory to Adversarial Attacks. In Proceedings ofAdvML'19: Workshop on Adversarial Learning Methods for Machine Learning and Data Mining at KDD. 2019. URL BibTeX

  97. Andre T Nguyen and Edward Raff. Heterogeneous Relational Kernel Learning. In 5th KDDWorkshop on Mining and Learning from Time Series. 2019. URL BibTeX

  98. Andre T Nguyen, Julia Lien, Edward Raff and Sumiko R Mekaru. Improved Automatic Pharmacovigilance : An Enhancement to the MedWatcher Social System for Monitoring Adverse Events. In Epidemiology Meets Data Mining and Knowledge Discovery Workshop at KDD. 2019. URL, DOI BibTeX

  99. Edward Raff, Jared Sylvester, Steven Forsyth and Mark McLean. Barrage of Random Transforms for Adversarially Robust Defense. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2019, 6528–6537. URL BibTeX

  100. Edward Raff. A Step Toward Quantifying Independently Reproducible Machine Learning Research. In NeurIPS. 2019. URL BibTeX

  101. Edward Raff, William Fleming, Richard Zak, Hyrum Anderson, Bill Finlayson, Charles K Nicholas, Mark Mclean, William Fleming, Charles K Nicholas, Richard Zak and Mark Mclean. KiloGrams: Very Large N-Grams for Malware Classification. In Proceedings of KDD 2019 Workshop on Learning and Mining for Cybersecurity (LEMINCS'19). 2019. URL BibTeX

  102. Edward Raff and Mark McLean. Hash-Grams On Many-Cores and Skewed Distributions. In 2018 IEEE International Conference on Big Data (Big Data). December 2018, 158–165. URL, DOI BibTeX

  103. Edward Raff and Jared Sylvester. Linear Models with Many Cores and CPUs: A Stochastic Atomic Update Scheme. In 2018 IEEE International Conference on Big Data (Big Data). December 2018, 65–73. URL, DOI BibTeX

  104. Edward Raff. Neural Fingerprint Enhancement. In 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA). December 2018, 118–124. URL, DOI BibTeX

  105. Edward Raff, Jon Barker, Jared Sylvester, Robert Brandon, Bryan Catanzaro and Charles Nicholas. Malware Detection by Eating a Whole EXE. In AAAI Workshop on Artificial Intelligence for Cyber Security. October 2018. URL BibTeX

  106. William Fleshman, Edward Raff, Richard Zak, Mark McLean and Charles Nicholas. {Static Malware Detection & Subterfuge: Quantifying the Robustness of Machine Learning and Current Anti-Virus}. In 2018 13th International Conference on Malicious and Unwanted Software (MALWARE). October 2018, 1–10.
    Best Paper Award. URL, DOI BibTeX

  107. Edward Raff and Charles K Nicholas. Lempel-Ziv Jaccard Distance, an effective alternative to ssdeep and sdhash. Digital Investigation, February 2018. URL, DOI BibTeX

  108. Edward Raff, Jared Sylvester and Steven Mills. Fair Forests: Regularized Tree Induction to Minimize Model Bias. In AAAI / ACM conference on Artificial Intelligence, Ethics, and Society. 2018. URL BibTeX

  109. Edward Raff and Charles Nicholas. Toward Metric Indexes for Incremental Insertion and Querying. arXiv, 2018. URL BibTeX

  110. Edward Raff and Charles Nicholas. Hash-Grams: Faster N-Gram Features for Classification and Malware Detection. In Proceedings of the ACM Symposium on Document Engineering 2018. 2018. URL, DOI BibTeX

  111. Edward Raff, Shannon Lantzy and Ezekiel Maier. Dr. AI, Where did you get your degree?. Proceedings of the First Joint Workshop on AI in Health organized as part of the Federated AI Meeting (FAIM 2018) 2142:204–207, 2018. URL BibTeX

  112. Jared Sylvester and Edward Raff. What About Applied Fairness?. In Machine Learning: The Debates (ML-D) organized as part of the Federated AI Meeting (FAIM 2018). 2018. URL BibTeX

  113. Edward Raff and Jared Sylvester. Gradient Reversal Against Discrimination. In Proceedings ofthe 5th Workshop on Fairness, Accountability and Transparency in Machine Learning. 2018. URL BibTeX

  114. Edward Raff. Growing and Retaining AI Talent for the United States Government. In AAAI FSS-18: Artificial Intelligence in Government and Public Sector. 2018. URL BibTeX

  115. Edward Raff, Jared Sylvester and Charles Nicholas. Engineering a Simplified 0-Bit Consistent Weighted Sampling. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2018, 1203–1212. URL, DOI BibTeX

  116. Josh Sullivan, Josh Elliot, Kirsten Lloyd and Edward Raff. My Fair Data: How the Government Can Limit Bias in Artificial Intelligence. 2018. URL BibTeX

  117. Richard Zak, Edward Raff and Charles Nicholas. What can N-grams learn for malware detection?. In 2017 12th International Conference on Malicious and Unwanted Software (MALWARE). October 2017, 109–118. URL, DOI BibTeX

  118. Edward Raff. JSAT: Java Statistical Analysis Tool, a Library for Machine Learning. Journal of Machine Learning Research 18(23):1–5, 2017. URL BibTeX

  119. Edward Raff and Charles Nicholas. An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '17. 2017, 1007–1015. URL, DOI BibTeX

  120. Edward Raff and Charles Nicholas. Malware Classification and Class Imbalance via Stochastic Hashed LZJD. In Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security. 2017, 111–120. URL, DOI BibTeX

  121. Edward Raff, Jared Sylvester and Charles Nicholas. Learning the PE Header, Malware Detection with Minimal Domain Knowledge. In Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security. 2017, 121–132. URL, DOI BibTeX

  122. Edward Raff, Richard Zak, Russell Cox, Jared Sylvester, Paul Yacci, Rebecca Ward, Anna Tracy, Mark McLean and Charles Nicholas. An investigation of byte n-gram features for malware classification. Journal of Computer Virology and Hacking Techniques, September 2016. URL, DOI BibTeX