Tech Talks in Data Science, AI and Machine Learning

How do you implement information extraction from images and free text? Can you apply advanced machine learning techniques to real-world scenarios? Can a machine ever learn to identify the base-level in knowledge graphs?

On Tuesday 15th October, in collaboration with Databricks’s Spark+AI Summit and Women in Tech Netherlands, we hosted Tech Talks in Data Science, AI and Machine Learning to discover just this. With help from our Chair, Esther Schagen-van Luit, Data Science, ML and AI enthusiast, as well as being specialist master in security architecture at Deloitte, each talk was warmly received by the 400+ audience.

Clemence Burnichon – Depop

Speaking in front of a full-house at Amsterdam RAI, Clemence Burnichon, lead data scientist at Depop, started the evening with a talk on Understanding the Depop Inventory. Clemence is leading the machine learning effort across Depop, specialising in generating smart capabilities to improve buyer’s and seller’s experience with Depop.

In her talk, Clemence explained that at Depop, their 15 million+ users can list items for sale with up to 4 images and a short description. In order to understand their inventory, Clemence and her team have worked on extracting information from images and free text by developing a machine learning model and deploying there model in production. She walked us through the journey they took to create such capabilities from solution design to deployment in production.

Adi Polak – Microsoft

End-to-end data science solutions are a hard task. Applying advanced ML approaches to real-world scenarios requires us to clean, prepare and feature engineer the data before we can even start discussing the algorithms. But this is not all, we need to test various ML models, try different configurations and techniques and last but not least, productionize the models.

Doing so while dealing with massive amounts of data is not exactly an ‘easy walk in the park’.

In our second talk, Adi Polak, a senior software engineer and a developer advocate at Microsoft working on Azure, demonstrated basics of the ML development process. She showed the audience how to perform feature engineering with Apache Spark, what ML pipelines are, and the various options for training and managing ML models with Apache Spark.

In her role at Microsoft, Adi, focuses on microservices architecture, distributed systems, real-time processing, big data analysis, machine learning at scale and functional programming. As a developer advocate, Adi brings her vast experience in tech and helps teams to design, architect and build their software and infrastructure with cost-effective, scalability, team knowledge and business market in mind.

Laura Hollink – CWI

To close our Meetup, Laura Hollink gave her talk on How to Make Knowledge Graphs Work for Humans. Laura is a tenure track researcher at CWI with an interest in both knowledge representation and human computer interaction.

Laura explained how she enriches knowledge graphs by predicting which concepts are “basic level concepts”. The basic level, according to experiments in cognitive psychology, is the level of abstraction in a hierarchy of concepts at which humans perform tasks quicker and with greater accuracy than at other levels.

Laura developed a method to machine learn the basic level from data. A range of experiments, performed on WordNet, showed that basic level prediction works well with a representative training set. To predict the basic level in a new domain, a domain-normalisation step is necessary. Concepts that are difficult to label for humans seem to also be harder to classify automatically. Future plans are to apply, improve and test the method on a large scale and on a wide range of knowledge graphs. All this is required to help machines better anticipate human behaviour.


Did you miss this Meetup? Recordings of these presentations will be available soon.

Read More

  • CWI spin-off DuckDB Labs partners with Motherduck, which raises $47.5 million

    CWI spin-off company DuckDB Labs helped create startup MotherDuck which aims to connect DuckDB to the cloud. MotherDuck sports some big names: its CEO is Jordan Tigani, founding engineer at Google’s BigQuery, Google’s fully managed data analysis platform. A big part of the $47.5 million funding comes from Andreessen Horowitz, a prominent venture capital firm, specialized in technology startups.

  • Building an international research community: HHAI Conference 2023

    Breaking ground as the first international conference on Hybrid Human Artificial Intelligence, HHAI22 held its first-ever in-person meeting in Amsterdam in the summer of 2022, establishing the beginnings of an international research community. ADS contributed to the conference by hosting a Meetup around the topic of Hybrid Intelligence.

  • Inaugural Speech Nanda Piersma

    There are legal rules and ethical frameworks, but little or no practical guidance on responsible design. In her inaugural lecture “System error, please restart”, Nanda Piersma argues what ‘responsible’ means and how we can carry out the (further) development of IT systems in such a way that they earn our trust.