PinnedPublished inDataDrivenInvestorP-DQN: An Unique Algorithm for Discrete-Continuous Hybrid Action SpaceParameterised Deep Q LearningJan 12, 2022A response icon1Jan 12, 2022A response icon1
Published inDataDrivenInvestorHold Your Horses, My Dear Reinforcement Learning AgentStopping RL Agent From Taking Erratic Action JumpsSep 12, 2022A response icon1Sep 12, 2022A response icon1
Published inAnalytics Vidhya(1/2) Same Story(MLE) Different Endings: Mean Square Error, Cross Entropy, KL DivergenceMathematically Proving That They are All the SameAug 1, 2022A response icon1Aug 1, 2022A response icon1
Published inThe Enlightened IndianHow Caste System Born and Functions In India — Part — 1Penning my understanding from the writings of Dr.B.R AmbedkarJan 30, 2022Jan 30, 2022
RL vs Optimal Control: LQR for Trajectory Tracking (With Python Code)The Linear Quadratic RegulatorNov 4, 2021A response icon1Nov 4, 2021A response icon1
Published inAnalytics VidhyaImitate with Caution: Offline and Online ImitationBehavioural Cloning, Data Aggregation Approach: DAGGER.Oct 6, 2021Oct 6, 2021
Published inNerd For TechA Unique Intersection of Game Theory and Data Science: E-Commerce Product PricingPrice Elasticity, Cross Elasticity and Nash EquilibriumJun 18, 2021Jun 18, 2021
Published inNerd For TechQuick Game Theory Blog Series For Dummies6 -blog series Each less than 5 MinutesMay 29, 2021May 29, 2021
Published inNerd For TechGame Theory: Nash Equilibrium For Mixed Strategies ( Part 6 )Continuous actions and Stochastic Strategic GamesMay 29, 2021May 29, 2021