Skip to content
View KlaraGtknst's full-sized avatar
  • Student at University of Kassel
  • Kassel

Highlights

  • Pro

Block or report KlaraGtknst

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
KlaraGtknst/README.md

Hi there, I'm Klara M. Gutekunst πŸ‘‹

I'm currently pursuing my Master's degree in Computer Science at the University of Kassel. My interests lie in machine learning, data analysis, Information Retrieval and Natural Language Processing. Feel free to explore my repositories to see the projects I've been working on.

πŸ”¬ Research and Projects

Here are some of the projects I've been involved in:

  • Bachelor Thesis: Identification of Key Information with Topic Analysis on Large Unstructured Text Data
    Repository: bachelor-thesis
    Description: This repository contains the written work on my Bachelor thesis, focusing on identifying key information using topic analysis techniques.

  • Discord Detection in Time Series Data
    Repository: discord_detection
    Description: A project to identify discords in time series data using the HOT SAX methodology.

  • Approaches for Finding Sample Pairs in Contrastive Learning
    Repository: master-seminar-ies
    Description: Work from my Master seminar focusing on methods to find sample pairs in contrastive learning, conducted under the Intelligent Embedded Systems chair.

  • Text topic
    Repository: text_topic
    Description: This repository implements a pipeline to store various data of files from a large unstructured dataset. These fields are used for topic modeling (wordclouds, based on low-dimensional versions of embedding vectors, Named Entity Clustering and document-topic incidences). The information is aggregated and visualised using FCA.

  • Topic Analysis of Text Data
    Repository: topic-analysis-text-data
    Description: This repository provides methods and functions to find similar documents in terms of content and visual appearance, i.e. layout, from a large corpus of unstructured text data.

  • Identifying fiscal fraud with anomaly detection techniques
    Repository: identifying-fiscal-fraud
    Description: Bachelor Seminar about exploring techniques to identify anomalies and fiscal fraud.

πŸš€ About Me

  • πŸ”­ I’m currently working on Data Mining of large unstructured (text) data and Information Retrieval research projects.
  • 🌱 I’m currently learning Argumentative search & Web search in the context of Information Retrieval.
  • πŸ‘― I’m looking to collaborate on Natural Language Processing, Information Retrieval projects and open-source initiatives.
  • πŸ’¬ Ask me about Natural Language Processing, Information Retrieval and Data Mining.
  • πŸ˜„ Pronouns: She/Her
  • ⚑ Fun fact: I enjoy visualizing complex data through creative infographics!

πŸ“Š GitHub Stats

Klara's GitHub stats

πŸ“« Connect with Me

Feel free to reach out if you're interested in collaborating or discussing any of my projects!

Popular repositories Loading

  1. I2OT_energy I2OT_energy Public

    I2OT Hackathon

    Jupyter Notebook 3 2

  2. bachelor-thesis bachelor-thesis Public

    This repository contains the written work on the Bachelor thesis 'Identification of key information with topic analysis on large unstructured text data'.

    TeX 2

  3. identifying-fiscal-fraud identifying-fiscal-fraud Public

    Seminar about exploring techniques to identify anomalies and fiscal fraud.

    TeX 2

  4. e2ml_SoSe23 e2ml_SoSe23 Public

    repository for the course Experimentation and Evaluation in Machine Learning (E2ML); master degree course of university of Kassel in SoSe23.

    Jupyter Notebook

  5. discord_detection discord_detection Public

    This repository aims to identify discords in time series data using the HOT SAX publication as a role model. The base code is result of the work of Dr. Christian Gruhl, while alterations to add the…

    Python

  6. topic-analysis-text-data topic-analysis-text-data Public

    This repo provides methods and functions to find similar documents in terms of content and visual appearance, i.e. layout, from a large corpus of unstructured text data.

    Jupyter Notebook