event

ML6 x De Delftse Bedrijvendagen

Kaat Van Doren
HR Manager

15

February

2022

13:45

Accelerate your career at ML6 @ De Delftse Bedrijvendagen

Location: Technische Universiteit Delft

-------

At ML6 we think it is important to give the opportunity to learn, grow and thrive while working with the best of the best in a dynamic environment. We gladly accelerate you in taking your first steps into the professional world and towards the career of your dreams. We are looking for diverse talent who shares our values and wants to make an impact that matters.

Take a look at the opportunities and apply when interested. Please mention the internship you are interested in.

1. Quantifying the semantic gap

The goal of this project is to estimate the expected effect that using a custom language model will have over an existing one based on known heuristics. The current best approach is to compare the overlap of n-grams from your domain-specific texts to those from the texts that the existing solution was trained on and install some (arbitrary) cut-off (i.e, if the n-gram overlap is under 30%, we explore using a custom language model). However, this method is not very quantitative nor very rigorous.

During this internship, you will:

  • Gain experience leveraging state-of-the-art NLP techniques.
  • Conduct applied research to solve a concrete real-world problem.
  • Let your creative skills loose on a major problem for many companies in a wide range of domains.

2. Anomaly detection in text

The goal of this project is to leverage different NLP techniques to arrive at an algorithm that can accurately highlight anomalous words and/or sentences in a document that you wouldn’t expect to appear in that document. This system would likely also exploit the repetitive nature of certain types of documents (e.g, rental contracts are 90% the same because they need a certain legal structure). If successful, such an algorithm could have very impactful use cases in fields such as the legal domain, insurance companies, etc. The concrete approach would be to develop such a system on legal documents but we are open to suggestions if there is another field that interests you more where it could also have a big impact.

During this internship, you will:

  • Gain experience leveraging state-of-the-art NLP techniques.
  • Conduct applied research to solve a concrete real-world problem.
  • Let your creative skills loose on a major problem for many companies in a wide range of domains.


3. Extractive text summarization

The goal of this project is to develop an algorithm that highlights the most important information in a document. However, how we define important information requires some creativity (i.e, it could mean sentences that summarise the text, sentences that are unexpected, etc.). Concretely, we see a major use case for legal texts where we can also exploit the repetitive nature of such documents (e.g, rental contracts are 90% the same because they need a certain legal structure) but we are open to suggestions if there is another field that interests you more where it could also have a big impact.

During this internship, you will:

  • Gain experience leveraging state-of-the-art NLP techniques.
  • Conduct applied research to solve a concrete real-world problem.
  • Let your creative skills loose on a major problem for many companies in a wide range of domains.



4. Ping Pong AI

For this use-case, we are interested in analysing a table tennis game in real-time. The challenge with table tennis is that it’s a very fast-paced sport, as the players are close to each other and the ball can go quite fast. Another challenge is the size of the ball, which is very small, especially because the camera needs to be quite far away to capture the table and players. At the moment, the TTNet paper by osai.ai seems to be the most promising one for solving these challenges.


During this internship, you will:

  • Extend the basic PyTorch implementation of TTNet to allow for analysing a table tennis game in real-time. Ideally, it would be able to recognise when a point has been made and by who.
  • Deploy your model to a Jetson Xavier and integrate it into the already existing pipeline.
  • Write the analysis results to a database (Firestore).
  • Extend the front-end (React) to show these analysis results (optional).


SPEAKERS

Our experts

guest speakers
RELATED ARTICLES

More insights on