DataJello.com - Page 3 of 7 - Anything about data analytics

Solving MDPs with dynamic programming

July 17, 2022July 19, 2022 John

In Reinforcement Learning, one way to solve finite MDPs is to use dynamic programming. Policy Evaluation (of the value functions) It refers to the iterative

machine learning

Finite Markov Decision Processes (MDP)

July 16, 2022July 17, 2022 John

Finite MDP is the formal problem definition that we try to solve in most of the reinforcement learning problem. Definition Finite MDP is a classical

machine learning

Multi-Armed Bandit Problem

July 16, 2022July 19, 2022 John

Pre-requisite: some understanding of reinforcement learning. If not, you can start from Reinforcement Learning Primer Goal Let’s analyze this in the classic Multi-Armed Bandit problem using

machine learning

Reinforcement Learning Primer

July 15, 2022August 30, 2022 John

Reinforcement learning is going to be “the next big thing” in machine learning after 2022, so let’s understand some basic on how it works. Agent:

machine learning

How Transformer Positional Encoding Works

July 14, 2022July 14, 2022 John

The positional encoding of transformer was a detail added in Attention Is All You Need. When I first saw this, I thought “why is the

machine learning

KL (Kullback–Leibler) Divergence and JS (Jensen-Shanon) Divergence

July 8, 2022July 9, 2022 John

The KL (Kullback–Leibler) Divergence and JS (Jensen-Shanon) Divergence are ways to measure the distance (similarity) between two distributions P and Q. I will try to

machine learning

Transformer NLP Tutorial in 2022: Finetune BERT on Amazon Review

June 29, 2022July 5, 2022 John

Background In 2022, if you are not new to NLP (Natural Language Processing), you should have heard of BERT (Bidirectional Encoder Representations from Transforms). It’s

statistics

Understanding the Chi-squared tests

June 23, 2022June 29, 2022 John

What are Chi-squared tests for? Compare an expected (hypothesized) categorical distribution vs an observed (sampled) categorical distribution Note that the distribution must be categorical (ie.

machine learning