Experience

Big Data Engineer
TU Vienna January 2021 – January 2023
Responsibilities include:
- managing and tuning a Big Data cluster
- holding courses and workshops on Big Data
HR Technology Working Student
SAP November 2018 – August 2020
Responsibilities include:
- managing innovative IT-projects such as Chatbots based on Machine Learning

Education

MSc Data Science
TU Vienna October 2021 – December 2024
Thesis is WIP.
Courses included:
- Advanced Methods for Regression and Classification
- Experiment Design for Data Science
- Modeling and Simulation
- Statistical Computing
MSc Artificial Intelligence (Erasmus)
KU Leuven February 2023 – September 2023
Courses included:
- Brain Computer Interfaces
- Image Analysis and Understanding
- Analysis of Large Scale Social Networks
- Information Retrieval and Search Engines
BSc Computer Science
University of Vienna October 2016 – December 2020
Thesis: “Evaluation of the multilingual Semantic Text Similarity” Abstract: Determining the semantic relatedness between texts, such as phrases and sentences, has become one of the most crucial tasks within the field of Natural Language Processing. To compute this semantic relatedness or Semantic Textual Similarity (STS), a given text has to be first transformed into a numerical representation, which can be achieved through various mathematical concepts, also called word embeddings. This thesis addresses the question of which word embeddings or, more generally, architectures determine STS most accurately. Some of the currently most notable models are the frequency-based Bag of Words (BoW) and the prediction-based Long Short-Term Memory (LSTM) and Bidirectional Encoder Representations from Transformers (BERT), each combined with a distance function, such as Cosine Similarity. In the scope of this thesis these STS models among others are investigated, implemented and trained on multilingual data, in English and German. It is established that prediction-based word embeddings result in more accuracy than frequency-based representations for English data sets. It is also discovered that the evaluation of the models highly depends on STS annotations in the data set, which are not provided in available German corpora.

Skills & Hobbies

Technical Skills

Python

Data Science

SQL

Hobbies

Climbing

Reading

Art

Chess

Languages

100%

German

90%

English

60%

Armenian

10%

Spanish

Experience

Big Data Engineer

HR Technology Working Student

Education

MSc Data Science

MSc Artificial Intelligence (Erasmus)

BSc Computer Science