Speakers

Vs.subrah

KEYNOTE

V.S. Subrahmanian University of Maryland

Prof. Subrahmanian is one of the world leaders in logical reasoning with uncertainty, probabilistic logics, temporal probabilistic logics, and managing huge, heterogeneous databases with incomplete and inconsistent information, and multimedia databases. In recent years, he has developed scalable methods to apply probabilistic logic models to a wide variety of real-world scenarios. He created the field of computational cultural dynamics with a suite of novel methods to analyze the behaviors of terrorist groups and applied them to making forecasts and suggesting policies to shape behaviors of groups like Hezbollah, Lashkar-e-Taiba, and Indian Mujahideen.

Antonia gogoglou

Antonia Gogoglou Aristotle University of Thessaloniki, SignalGeneriX Ltd Cyprus

Antonia is a graduating PhD candidate at Aristotle University of Thessaloniki, Greece, and a Data Scientist at SignalGeneriX, Cyprus. Her research interests include complex network analysis in real world graphs, knowledge discovery and predictive modeling. She is a Marie Curie Rise Fellow with experience in user profiling and fraud detection for security on the Web through graph decomposition in large scale networks.

Brizan david guy

David Guy Brizan University of San Francisco / CUNY Graduate Center

David Guy Brizan is an Assistant Professor in the University of San Francisco's M.S. in Analytics program. His research areas include natural language processing, databases and social networks. Prior to joining USF, he was a researcher in the Speech Lab at Queens College, manager of analytics and product groups in the City of New York and a software engineer in companies such as IBM and McKesson.

Antonia gogoglou

Antonia Gogoglou Aristotle University of Thessaloniki, SignalGeneriX Ltd Cyprus

Antonia is a graduating PhD candidate at Aristotle University of Thessaloniki, Greece, and a Data Scientist at SignalGeneriX, Cyprus. Her research interests include complex network analysis in real world graphs, knowledge discovery and predictive modeling. She is a Marie Curie Rise Fellow with experience in user profiling and fraud detection for security on the Web through graph decomposition in large scale networks.

Michael terry

Michael Terry Lawfty

Mike is the technical co-founder of Lawfty, a data-driven system that helps injured people find legal services in their time of need. Prior to starting Lawfty in 2013, Mike co-founded a streaming video analytics platform SeeTheScene.TV, and worked for many years as a signal processing engineer for Raytheon and SGI. He has a BS and M.Eng from MIT and a minor in Psychology.

Kevin stofan

Kevin Stofan DataRobot

Kevin is a Data Scientist at DataRobot and an Adjunct Professor at Penn State University where he teaches a graduate level Geographic Information Systems (GIS) course. He has over 16 years of experience using GIS and geospatial analysis to solve real world business problems. His experience with modeling geospatial phenomena include geostatistical, spatial econometric, and point pattern analysis.

Nadia Udler

My background is in Applied Math and Computer Science. I have over 20 years of experience developing software for financial industry. Currently I teach Simulations for Financial Applications in Fordham University Graduate School of Business

Sargur Srihari Unversity at Buffalo, The State University of New York

Srihari is a SUNY Distinguished Professor in the Department of Computer Science and Engineering at the University at Buffalo, The State University of New York. He has been working in the fields of Artificial Intelligence and Machine Learning for over four decades. At present, he teaches a sequence of advanced courses on machine learning and deep learning. A laboratory that Srihari founded developed the world’s first automated system for reading handwritten postal addresses.

Patrick Hall H2O.ai

Patrick Hall is a product engineer and senior data scientist at H2O.ai where he focuses on model interpretability and works with H2o.ai customers to derive substantive business value from machine learning technologies. Patrick is also currently an adjunct professor in the Department of Decision Sciences at George Washington University, where he teaches graduate classes in data mining and machine learning. 

Thuc Tran The George Washington University

Thuc Tran is currently pursuing his PhD in Systems Engineering at George Washington University, having previously completed his MS in Computer Science at Georgia Tech. He also has an MS in Systems Engineering from GW and a BS in Information Technology from George Mason University. Professionally, he is an Engineer with the Center for Machine Learning within Capital One.

Sanjana Ramprasad

Graduated with a Master's in Computer Science with a specialization in Artificial Intelligence and a thesis on Handwriting analysis using Deep Learning.

Harish Krishnamurthy

Harish Krishnamurthy has a Masters from Northeastern University, Boston in Communication & Digital Signal Processing, EE. More than 15 years of experience in Machine Learning, Data Science & Digital Signal Processing. Work experience includes: Mitsubishi Electric Research Labs (MERL), Schlumberger Doll Research, Nokia, and MassMutual Financial Group. He has several publications and patents filed in the areas of ML. He currently leads the Data Science & ML effort at Colaberry Inc.

Michael Becker

Michael Becker is a Senior Data Scientist at Penn Medicine where he is building machine learning systems to improve patient outcomes. Michael organizes the DataPhilly Meetup group, and regularly presents at community events and conferences: http://beckerfuffle.com

Sanjana Ramprasad

Graduated with a Master's in Computer Science with a specialization in Artificial Intelligence and a thesis on Handwriting analysis using Deep Learning.

Van Lindberg Dykema Cox Smith

I am Senior Counsel at Dykema Cox Smith, with a practice focusing on intellectual property and open source matters. I specialize in the intersection of technology and law, and was in 2012 named one of "America's Top 12 Techiest Attorneys" by the American Bar Association Journal. My hobby is machine learning, which I use to data-mine the US Patent Database in search of better prior art.

Francois Dion

Founder of PYPTUG (PYthon Piedmont Triad User Group), Owner of R&D firm Dion Research LLC and Data Scientist at Inmar Inc. "Artist, Coder, Data Scientist, Entrepreneur, Hacker, Mentor, Musician, Photographer, Polyglot and Polymath"

Jim Crist Jim: Continuum Analytics, Chris and Aron: Capital One

Jim Crist holds a Bachelors and a (tentative) Masters in Mechanical Engineering from the University of Minnesota. Whilst procrastinating on his thesis, he got involved in the scientific Python community. He is currently a software developer at Continuum Analytics. Chris White holds a Ph.D. in Mathematics from UT-Austin. He is currently a Data Scientist at Capital One. He loves running and data.

Olisa Stephensbailey Booz Allen Hamilton

Olisa Stephensbailey leads the Applied Cyber Data Science functional group at Booz Allen Hamilton. Her research interests include cyberspace operations, computational linguistics, computer forensics, natural language processing, and data science.

Doug Liming

As an enterprise architect, Douglas Liming leverages his 21 years of experience to design cradle-to-grave enterprise analytic platforms. His solutions address data extraction and manipulation through predictive analytics, focusing on Hadoop and open ecosystems at the epicenter. He holds a BS in Computer Science from the University of North Carolina at Wilmington and runs Fedora 25 on his laptop.

Ria Baldevia Booz Allen Hamilton

Ria Baldevia is a researcher at Booz Allen Hamilton. Her research interests include Python, digital anthropology and ethnography, and machine learning.

Austin Taylor

Austin Taylor (www.austintaylor.io) has an extensive background in Defensive and Offensive Cyber Operations and has performed incident response for some of the world's top Fortune companies. He is the co-author to "Crash Course in Data Science for Hackers" which is taught annually at Blackhat and author of "How to Build a World Class Monitoring System for Home, Small Office, or Enterprise".

Frank La Vigne

Frank LaVigne is Chief Evangelist at DataLeader.IO, where he helps customers leverage data science in order to create smarter solutions and happier customers. He blogs regularly at FranksWorld.com and has a YouTube channel called Frank's World TV. (www.FranksWorld.TV) You can find him on Twitter at @tableteer.

Karan Saxena Research Intern, Machine and Language Learning (MALL) Lab, Indian Institute of Science (IISc), India.

Karan is a senior year, CS major undergrad, currently a research intern under Dr. Partha Talukdar at the Machine and Language learning (MALL) lab, IISc, India. He was the IAS Summer Research Fellow 2016 at the Reconfigurable and Intelligent Systems Engineering (RISE) Lab, IIT-M. Previously, he has been a GSoC 2016 student, GSoC 2017 mentor, Snapdeal S'15 Intern, and part of GSA class of 2014.

En Zyme Ad Hoc and Nimble, a consultancy

.

Timothy Chen Hyperpilot

Timothy Chen is the CTO of Hyperpilot, and also a PMC/committer on Apache Drill and Apache Mesos. Before joining Hyperpilot, Timothy was the lead engineer at Mesosphere working on container runtime and Spark on Mesos.

Jacob Schreiber Paul G. Allen School of Computer Science, University of Washington

Jacob Schreiber is a graduate student and IGERT Big Data Fellow at the University of Washington. His research focuses on the application of deep models to questions arising in genome science, typically involving massive data sets. In his spare time he is the author of pomegranate, a probabilistic modeling package for python, and a developer for sklearn, a popular python package for classical machi

Natalie Evans Harris

Most recently as a Senior Policy Advisor to the US Chief Technology Officer in the Obama Administration, Natalie Evans Harris has dedicated over 15 years to tackling our nation’s greatest challenges leveraging technology and data. Establishing communities focused on leveraging data to advance national priorities, she consistently demonstrates the power of innovation to break down silos and modernize government’s approach to public service.

Jim Klucar

Jim Klucar is the Director of Data Science at Immuta, a unified data platform for the world’s most secure organizations. After a dozen years of developing high performance radar processing techniques, in 2010 he switched to developing Hadoop-based data warehouse and analysis systems. Jim holds a BS in Electrical Engineering from Pennsylvania State University and a MS in Applied and Computational Mathematics from Johns Hopkins University.

Pramit Choudhary DataScience.com

I am a lead DataScientist/ML Practitioner at Datascience Inc. In my past life I have worked on solving problems related to predicting pre-emptive measure to avoid failure for improving operating efficiency in Oil n Gas Industry, social media analysis, recommendation engines, match-making using statistical models, fraud-detection, natural language processing and others.

Vishal Patel

Vishal Patel is a data science consultant with 15+ years of experience in advanced analytics. He has worked as a lead data scientist across several Fortune 500 clients. He holds two Master’s degrees: MS in Computer Science, and MS in Decision Sciences (emphasis on Statistics). Vishal is currently running his start-up, Derive (www.derive.io), which focuses on automated advanced analytics products and data science services.

Evan Richards

Evan has been extracting value from messy data for over five years. When he's not preaching the good word of arithmancy at small startups all over the Silicon Valley, he's petting the neighborhood cats in Oakland, Calif.

Jason Walsh Penn Medicine

Jason Walsh is a Data Engineer working on the Predictive Healthcare team at Penn Medicine. His interests include microservices architecture, automation, and the Python and Go programming languages: https://github.com/rightlag Michael Becker is a Senior Data Scientist at Penn Medicine where he is building machine learning systems to improve patient outcomes. Michael organizes the DataPhilly Meetup group, and regularly presents at community events and conferences: http://beckerfuffle.com

Dmitry Petrov

Dmitry Petrov lives in Silicon Valley and works on ML problems. He was a data scientist at Microsoft Bing and researcher at a university. He holds a Ph.D. degree in Computer Science. Now he is working on his open source project Data Version Control (DVC) which makes data science projects and ML pipeline reproducible by automatically building data dependency graph (DAG) and sharing code by Git and data by cloud storage (AWS S3, GCP) in a single DVC environment.

Anne-Marie Currie Advisory Board

Anne-Marie Currie, is a Principal Data Scientist at Advisory Board. She has worked with data in the domains of medical informatics, legal informatics, and consumer analytics. She has contributed to the design, development, and implementation of innovative technical solutions that leverage natural language processing and machine learning. She enjoys working on a diverse set of business problems. She received her Ph.D. in linguistics from the University of Texas at Austin.

Jason Walsh Penn Medicine

Jason Walsh is a Data Engineer working on the Predictive Healthcare team at Penn Medicine. His interests include microservices architecture, automation, and the Python and Go programming languages: https://github.com/rightlag

Austin Taylor

Austin Taylor (www.austintaylor.io) has an extensive background in Defensive and Offensive Cyber Operations and has performed incident response for some of the world's top Fortune companies. He is the co-author to "Crash Course in Data Science for Hackers" which is taught annually at Blackhat and author of "How to Build a World Class Monitoring System for Home, Small Office, or Enterprise".

Vishal Patel

Vishal Patel is a data science consultant with 15+ years of experience in advanced analytics. He has worked as a lead data scientist across several Fortune 500 clients. He holds two Master’s degrees: MS in Computer Science, and MS in Decision Sciences (emphasis on Statistics). Vishal is currently running his start-up, Derive (www.derive.io), which focuses on automated advanced analytics products and data science services.

Eric Schles NYU

Eric Schles is a practicing data scientist that has worked in the anti trafficking space for more than 10 years. He has worked at the local level, the state level and the national level on anti trafficking. He is also an adjunct professor at NYU where he teaches courses on applied machine learning and programming in python.

Andrew Montalenti Corporate - http://parse.ly

Andrew is the CTO of Parse.ly, a platform that makes it easy to understand and improve digital audience engagement through content-based data products. His team's work with the media industry includes powering the real-time dashboards used by staff at The Huffington Post, TechCrunch, Slate, Mashable, and many other top sites. Andrew is also an open source contributor in the Python community.

Frank La Vigne

Frank LaVigne is Chief Evangelist at DataLeader.IO, where he helps customers leverage data science in order to create smarter solutions and happier customers. He blogs regularly at FranksWorld.com and has a YouTube channel called Frank's World TV. (www.FranksWorld.TV) You can find him on Twitter at @tableteer.

Phil Roth Endgame

As a data scientist at Endgame, Phil develops data products that help security analysts find and respond to threats. This work has ranged from building a machine learning malware classifier to creating a data exploration tool for network traffic data. While earning a PhD in physics, Phil used machine learning and data collected at the South Pole to search for neutrinos from other galaxies.

Jacob Schreiber Paul G. Allen School of Computer Science, University of Washington

Jacob Schreiber is a graduate student and IGERT Big Data Fellow at the University of Washington. His research focuses on the application of deep models to questions arising in genome science, typically involving massive data sets. In his spare time he is the author of pomegranate, a probabilistic modeling package for python, and a developer for sklearn, a popular python package for classical machi

Rebecca Bilbro Bytecubed & District Data Labs

Dmitry Petrov

Dmitry Petrov lives in Silicon Valley and works on ML problems. He was a data scientist at Microsoft Bing and researcher at a university. He holds a Ph.D. degree in Computer Science. Now he is working on his open source project Data Version Control (DVC) which makes data science projects and ML pipeline reproducible by automatically building data dependency graph (DAG) and sharing code by Git and data by cloud storage (AWS S3, GCP) in a single DVC environment.

Ria Baldevia Booz Allen Hamilton

Ria Baldevia is a researcher at Booz Allen Hamilton. Her research interests include Python, digital anthropology and ethnography, and machine learning.

Benjamin Bengfort District Data Labs

Benjamin is partner at District Data Labs and core contributor to Yellowbrick, a model visualization library for machine learning. When not teaching at Georgetown, he is working to finish his PhD at the University of Maryland. Benjamin's diverse interests span from NLP to distributed computing, and he is author of Data Analytics with Hadoop and the forthcoming Applied Text Analytics with Python.

Awalin Sopan FireEye Inc

Awalin Sopan is a senior engineer at FireEye Inc. As a part of the Analysis team, she develops analytics platform used by FireEye’s SOC team for detecting cyber threats. Her background in Human-Computer Interaction and Data Visualization helps her build more engaging and efficient workflow for the analysts, using different aspects of interactive data visualization and data analysis. She has an MS in Computer Science from the University of Maryland.

Rebecca Bilbro Bytecubed & District Data Labs

Jason Walsh Penn Medicine

Jason Walsh is a Data Engineer working on the Predictive Healthcare team at Penn Medicine. His interests include microservices architecture, automation, and the Python and Go programming languages: https://github.com/rightlag

Malek Ben Salem

Rachita Chandra

Rachita Chandra is a Solutions Architect at IBM Watson health where she builds end-to-end solutions with machine learning pipelines in cognitive healthcare applications. She has experience implementing large scale, distributed machine learning algorithms. She holds a Masters in Electrical and Computer Engineering from Carnegie Mellon.

Eric Schles NYU

Eric Schles is a practicing data scientist that has worked in the anti trafficking space for more than 10 years. He has worked at the local level, the state level and the national level on anti trafficking. He is also an adjunct professor at NYU where he teaches courses on applied machine learning and programming in python.

Andrew Montalenti Corporate - http://parse.ly

Andrew is the CTO of Parse.ly, a platform that makes it easy to understand and improve digital audience engagement through content-based data products. His team's work with the media industry includes powering the real-time dashboards used by staff at The Huffington Post, TechCrunch, Slate, Mashable, and many other top sites. Andrew is also an open source contributor in the Python community.

Craig Warman

Craig Warman is a Solutions Architect for StreamSets and an organizer of the Atlanta Hadoop and Spark User Group meetups. He's worked in a variety of database/software/consulting capacities for the last 25 years after graduating from Virginia Tech with a CS degree in 1989. He's a distance runner, a motorcycle rider, and actively engaged with prison ministry in the Atlanta area.

Benjamin Bengfort District Data Labs

Benjamin is partner at District Data Labs and core contributor to Yellowbrick, a model visualization library for machine learning. When not teaching at Georgetown, he is working to finish his PhD at the University of Maryland. Benjamin's diverse interests span from NLP to distributed computing, and he is author of Data Analytics with Hadoop and the forthcoming Applied Text Analytics with Python.

Phil Roth Endgame

As a data scientist at Endgame, Phil develops data products that help security analysts find and respond to threats. This work has ranged from building a machine learning malware classifier to creating a data exploration tool for network traffic data. While earning a PhD in physics, Phil used machine learning and data collected at the South Pole to search for neutrinos from other galaxies.

Dalila Benachenhou Femvestor, Inc.

Dalila Benachenhou is currently CEO of Femvestor, Inc., a data insight consulting firm. She also teaches statistics at GWU. She built Fuzzy System trading models, developed affinity portfolio, built tools and methodology for executivies and companies profiling, over 20 years experience with text. Graduate degrees in both Statistics and Computer Science.

Ben Mabey Recursion Pharmaceuticals

Ben is a Lead Data Scientist at Recursion Pharmaceuticals where he helps discover cures for rare diseases. Prior to that he worked in the online advertising industry as a Machine Learning Engineer and has led various data science projects within the healthcare space. ​Ben earned his MS in Computer Science from the University of Utah where he studied computer graphics and machine learning. Outside of work Ben enjoys spending time with his wife and kids, trail running, and competing in triathlons.

Sarah Masud

S​arah is an engineer at Red Hat where she works on developer-oriented analytic projects. Her bachelor’s thesis under Dr Tanvir Ahmad on Topics Modeling was presented at Ninth International Conference on Contemporary Computing(Aug 2016).She is ever enthusiastic about Data Science, Women in STEM, and Open Source, and volunteers her time for the same.

Awalin Sopan FireEye Inc

Awalin Sopan is a senior engineer at FireEye Inc. As a part of the Analysis team, she develops analytics platform used by FireEye’s SOC team for detecting cyber threats. Her background in Human-Computer Interaction and Data Visualization helps her build more engaging and efficient workflow for the analysts, using different aspects of interactive data visualization and data analysis. She has an MS in Computer Science from the University of Maryland.

Mohammed Eslami Netrias, LLC

Mohammed Eslami is the Chief Data Scientist at Netrias where he establishes best practices, and develops the strategy for acquiring, managing, and applying Artificial Intelligence techniques to extract insights from multi-omics data. His research interests include the application of machine learning algorithms to biomedical research data that will lead to better diagnostics and therapeutics. Mohammed earned his Ph.D. in Electrical Engineering from the University of Maryland, served as a post-doctoral fellow at the Food and Drug Administration, and leads a team of data scientists on a Big Data analytics program at the Defense Advanced Research Projects Agency (DARPA). He is leading Netrias in applying some of the analytics developed at DARPA to collaborations with various institutes at NIH - to include National Cancer Institute (NCI).

Zachary Beaver Alphabet Inc. / Nest Labs

Zach Beaver is a data scientist at Alphabet's Nest Labs, working across the business and product organizations. He plays a hybrid data scientist/engineer role, leveraging machine learning to catch hardware fraud and to perform cross-selling, while also building data pipelines to automate these analyses. Zach earned his masters from UC Berkeley in 2015 and is former high school calculus teacher.