This playlab encompasses a multitude of projects crafted through the utilization of LLM Models with the use of RLHF, showcasing the versatility and impact of these models across various applications.
Project Title | Substack Link | Github Link |
---|---|---|
Revolutionizing Content Generation: The Dynamic Duo of RLHF Unleashed - PPO and PEFT Fine-Tuned LLMs -PART I | Link | link |
Revolutionizing Content Generation: The Dynamic Duo of RLHF Unleashed - PPO and PEFT Fine-Tuned LLMs -PART II | link | link |
Revolutionizing Content Generation: The Dynamic Duo of RLHF Unleashed - PPO and PEFT Fine-Tuned LLMs-PART III | link | link |