PinnedKowshik chilamkurthyinDataDrivenInvestorP-DQN: An Unique Algorithm for Discrete-Continuous Hybrid Action SpaceParameterised Deep Q LearningJan 12, 20221Jan 12, 20221

Kowshik chilamkurthyReinforcement Learning for Optimal Station PolicyProblem StatementOct 24, 2022Oct 24, 2022

Kowshik chilamkurthyinDataDrivenInvestorHold Your Horses, My Dear Reinforcement Learning AgentStopping RL Agent From Taking Erratic Action JumpsSep 12, 20221Sep 12, 20221

Kowshik chilamkurthyinAnalytics Vidhya(1/2) Same Story(MLE) Different Endings: Mean Square Error, Cross Entropy, KL DivergenceMathematically Proving That They are All the SameAug 1, 20221Aug 1, 20221

Kowshik chilamkurthyinThe Enlightened IndianHow Caste System Born and Functions In India — Part — 1Penning my understanding from the writings of Dr.B.R AmbedkarJan 30, 2022Jan 30, 2022

Kowshik chilamkurthyRL vs Optimal Control: LQR for Trajectory Tracking (With Python Code)The Linear Quadratic RegulatorNov 4, 20211Nov 4, 20211

Kowshik chilamkurthyinAnalytics VidhyaImitate with Caution: Offline and Online ImitationBehavioural Cloning, Data Aggregation Approach: DAGGER.Oct 6, 2021Oct 6, 2021

Kowshik chilamkurthyinNerd For TechA Unique Intersection of Game Theory and Data Science: E-Commerce Product PricingPrice Elasticity, Cross Elasticity and Nash EquilibriumJun 18, 2021Jun 18, 2021

Kowshik chilamkurthyinNerd For TechQuick Game Theory Blog Series For Dummies6 -blog series Each less than 5 MinutesMay 29, 2021May 29, 2021

Kowshik chilamkurthyinNerd For TechGame Theory: Nash Equilibrium For Mixed Strategies ( Part 6 )Continuous actions and Stochastic Strategic GamesMay 29, 2021May 29, 2021