Project

This is the list of my personal projects.

CodeMachine LearningProjectVideosYouTube

Why Your RL Model Fails: Prioritized Replay and Actor-Critic in code (Part 2)

In this video, I break down the code behind designing an RL agent with an Actor-Critic architecture using a prioritized replay buffer! 🤖💻 Discover how to tackle sparse rewards, optimize training efficiency, and boost your model’s performance with practical tips and WandB tracking. If you want to go beyond theory and see how to implement these concepts in code, this is the video for you! Check it out and level up your RL skills today!

Read More
AcademicCodeConceptMachine LearningProjectVideos

Score Prediction from User Log with BERT

Discover latest project: predicting user scores from interaction logs with a quiz system! By analyzing user behavior and leveraging cutting-edge ML techniques like DistilBERT, curriculum learning, and LoRA, we’ve crafted a robust model to infer scores without needing correct answers or user choices. Dive into our innovative approach and see how we’re revolutionizing educational tools. #MachineLearning #AI #EdTech #Robotics #DistilBERT #CurriculumLearning #LoRA

Read More