outreach

Teaching, conferences, interviews, meetups, etc

This is a curated archive of my engagements in teaching, conferences, interviews, meetups, and other related events. After 2015 my work changed and so did my involvement in these activities (and to be fair: most of this work was extracurricular, and becoming a parent for the first time in 2014 did not leave much extra space). I’m still fondly looking back at this work though, and am sure that some time in the future I will do more of it again.


Cyber Security Sessions, 2015

Organiser

Meetup about current research around network anomaly detection. Talks by:

  • Rick Hofstede (PhD student, University of Twente) “Challenges and Recent Advances in Flow-based Intrusion Detection
  • Ömer Yüksel (PhD student, Eindhoven University of Technology): “Combining Anomaly-based Detection Methods with Deep Protocol Inspection

Norvig Web Data Science Award, 2014 edition

Review committee member

The Norvig Web Data Science Award is an award for students and researchers studying at or employed by a research institute or university in the Netherlands. It is a challenge, sponsored by SURFsara, Common Crawl and Github, in which participants show what they can do with the Common Crawl dataset - a snapshot of a large part of the web - using SURFsara’s Hadoop service to provide big data compute power. The challenge is named after Peter Norvig, Research Director at Google and author of the seminal textbook Artificial Intelligence: A modern Approach.

2014 Winner: “ImTrends - The Trends of Images in World Wide Web” (by Feng Wang, Hong Huang & Vincent Gong, MSc students at Delft University of Technology)

2014 Committee: Peter Norvig, Jimmy Lin, Arjen de Vries, Evert Lammerts


Hadoop Summit Europe 2014

Chair of the Future of Hadoop track

The Future of Hadoop track included talks about what’s next for large-scale data storage and processing with Hadoop and friends, beyond the state of the art. Read the interview here (archived).


Norvig Web Data Science Award, 2013 edition

Organiser, review committee member

The Norvig Web Data Science Award is an award for students and researchers studying at or employed by a research institute or university in the Netherlands. It is a challenge, sponsored by SURFsara, Common Crawl and Github, in which participants show what they can do with the Common Crawl dataset - a snapshot of a large part of the web - using SURFsara’s Hadoop service to provide big data compute power. The challenge is named after Peter Norvig, Research Director at Google and author of the seminal textbook Artificial Intelligence: A modern Approach.

2013 Winner: “Traitor - associating Concepts using the World Wide Web” (by Lesley Wevers, Oliver Jundt, and Wanno Drijfhout, MSc students at the University of Twente)

2013 Committee: Peter Norvig, Ricardo Baeza-Yates, Hilary Mason, Jimmy Lin, Evert Lammerts


Hadoop Summit Europe 2013

Chair of the Operating Hadoop track

The Operating Hadoop track was all about large-scale infrastructure operations, including architecture, administration, monitoring, optimisation, and scalability, including examples and best practices. Read the interview here (archived).


Apache Drill development workshop, 2013

Organiser

An Apache Drill workshop with Ted Dunning, chief scientist at MapR and long-time contributor to Apache Hadoop, Zookeeper and HBase. Sponsored by the University of Amsterdam.


NL-HUG, Netherlands Hadoop User Group, 2010-2013

Founder & organiser

I was the founder and main organizer of the Dutch Hadoop User Group (NL-HUG). This group grew to over 400 members over the course of 3 years and was the leading forum for Hadoop users from industry and academia in The Netherlands. We had speakers like Russel Jurney (Hortonworks, author of Agile Data Science), Lisa Green (CommonCrawl, director), Jimmy Lin (Professor at University of Waterloo, Twitter, Cloudera), Amr Awadallah (Co-founder Cloudera), Arun Murthy (Co-founder Hortonworks), and many others.


HPC Hadoop course, 2012

Lecturer

Course website. Course at the University of Amsterdam for grad and postgrad students, about the MapReduce computational model and data locality with HDFS, covering both theory and practice.


IIR Big Data Analytics, Hadoop lectures, 2012

Lecturer

Big Data Analytics course centered around large scale data processing with Apache Hadoop.


SIKS Big Data course for PhD students, 2011

Organizer & lecturer

A Hadoop / MapReduce course (summary) for the School for Information and Knowledge Systems (SIKS), with prof. Djoerd Hiemstra, prof. Arjen de Vries and prof. Jimmy Lin.

Abstract: The School for Information and Knowledge Systems SIKS and the Dutch e-science grid BiG Grid organized a new twoday tutorial on Big Data at the University of Twente on 30 November and 1 December 2011, just preceding the DutchBelgian Database Day. The tutorial is on top of some exciting new developments in large-scale data processing and data centers, initiated by Google, and followed by many others such as Yahoo, Amazon, Microsoft, and Facebook. The course teaches how to process terabytes of data on large clusters, and discusses several core computer science topics adapted for big data, such as new file systems (Google File System and Hadoop FS), new programming paradigms (MapReduce), new programming languages and query languages (Sawzall, Pig Latin), and new ‘noSQL’ databases (BigTable, Cassandra and Dynamo).


Several LHC grid tutorials, 2009-2013

Lecturer