eliashornberg

eliashornberg

Highlights

CartPole-A2C-reinforcement-learning CartPole-A2C-reinforcement-learning Public

This repository contains the implementation of the K-workers, n-step Advantage Actor-Critic (A2C) algorithm applied to the CartPole environment, as part of a reinforcement learning project for the …

Jupyter Notebook
EPFLLaMA EPFLLaMA Public

EPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, an…

Jupyter Notebook