PinnedPublished inDataDrivenInvestorP-DQN: An Unique Algorithm for Discrete-Continuous Hybrid Action SpaceParameterised Deep Q LearningJan 12, 20221Jan 12, 20221
Published inDataDrivenInvestorHold Your Horses, My Dear Reinforcement Learning AgentStopping RL Agent From Taking Erratic Action JumpsSep 12, 20221Sep 12, 20221
Published inAnalytics Vidhya(1/2) Same Story(MLE) Different Endings: Mean Square Error, Cross Entropy, KL DivergenceMathematically Proving That They are All the SameAug 1, 20221Aug 1, 20221
Published inThe Enlightened IndianHow Caste System Born and Functions In India — Part — 1Penning my understanding from the writings of Dr.B.R AmbedkarJan 30, 2022Jan 30, 2022
RL vs Optimal Control: LQR for Trajectory Tracking (With Python Code)The Linear Quadratic RegulatorNov 4, 20211Nov 4, 20211
Published inAnalytics VidhyaImitate with Caution: Offline and Online ImitationBehavioural Cloning, Data Aggregation Approach: DAGGER.Oct 6, 2021Oct 6, 2021
Published inNerd For TechA Unique Intersection of Game Theory and Data Science: E-Commerce Product PricingPrice Elasticity, Cross Elasticity and Nash EquilibriumJun 18, 2021Jun 18, 2021
Published inNerd For TechQuick Game Theory Blog Series For Dummies6 -blog series Each less than 5 MinutesMay 29, 2021May 29, 2021
Published inNerd For TechGame Theory: Nash Equilibrium For Mixed Strategies ( Part 6 )Continuous actions and Stochastic Strategic GamesMay 29, 2021May 29, 2021