PinnedKowshik chilamkurthyinDataDrivenInvestorP-DQN: An Unique Algorithm for Discrete-Continuous Hybrid Action SpaceParameterised Deep Q LearningJan 12, 20221Jan 12, 20221
Kowshik chilamkurthyReinforcement Learning for Optimal Station PolicyProblem StatementOct 24, 2022Oct 24, 2022
Kowshik chilamkurthyinDataDrivenInvestorHold Your Horses, My Dear Reinforcement Learning AgentStopping RL Agent From Taking Erratic Action JumpsSep 12, 20221Sep 12, 20221
Kowshik chilamkurthyinAnalytics Vidhya(1/2) Same Story(MLE) Different Endings: Mean Square Error, Cross Entropy, KL DivergenceMathematically Proving That They are All the SameAug 1, 20221Aug 1, 20221
Kowshik chilamkurthyinThe Enlightened IndianHow Caste System Born and Functions In India — Part — 1Penning my understanding from the writings of Dr.B.R AmbedkarJan 30, 2022Jan 30, 2022
Kowshik chilamkurthyRL vs Optimal Control: LQR for Trajectory Tracking (With Python Code)The Linear Quadratic RegulatorNov 4, 20211Nov 4, 20211
Kowshik chilamkurthyinAnalytics VidhyaImitate with Caution: Offline and Online ImitationBehavioural Cloning, Data Aggregation Approach: DAGGER.Oct 6, 2021Oct 6, 2021
Kowshik chilamkurthyinNerd For TechA Unique Intersection of Game Theory and Data Science: E-Commerce Product PricingPrice Elasticity, Cross Elasticity and Nash EquilibriumJun 18, 2021Jun 18, 2021
Kowshik chilamkurthyinNerd For TechQuick Game Theory Blog Series For Dummies6 -blog series Each less than 5 MinutesMay 29, 2021May 29, 2021
Kowshik chilamkurthyinNerd For TechGame Theory: Nash Equilibrium For Mixed Strategies ( Part 6 )Continuous actions and Stochastic Strategic GamesMay 29, 2021May 29, 2021