Pinned Loading
-
CartPole-A2C-reinforcement-learning
CartPole-A2C-reinforcement-learning PublicThis repository contains the implementation of the K-workers, n-step Advantage Actor-Critic (A2C) algorithm applied to the CartPole environment, as part of a reinforcement learning project for the …
Jupyter Notebook
-
EPFLLaMA
EPFLLaMA PublicEPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, an…
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.