Skip to content

DataJello.com

Anything about data analytics

  • Get Current Timestamp
  • Timestamp Converter

Author: John

  • Home
  • John
  • Page 2
machine learning

Approximate Function Methods in Reinforcement learning

July 26, 2022August 3, 2022 John

Tabular vs Function Methods In reinforcement learning, there are a few methods that are called tabular methods because they track a table of the (input,

Read More
statistics

Statistical Reasoning of Hypothesis Testing for Beginner

July 22, 2022July 22, 2022 John

For beginners I would like t give an explanation for beginner to understand the basis of hypothesis testing and I have tag various section with

Read More
machine learning

Planning with Tabular Methods in Reinforcement Learning

July 21, 2022July 21, 2022 John

Tabular methods Tabular methods refer to problems in which the state and actions spaces are small enough for approximate value functions to be represented as

Read More
machine learning

Temporal Difference Control in Reinforcement Learning

July 20, 2022July 30, 2022 John

Temporal Difference learning is one of the most important idea in Reinforcement Learning. We should go over the control aspect of TD to find an

Read More
machine learning

Temporal Difference Learning

July 20, 2022July 21, 2022 John

Temporal Difference (TD) learning is the most novel and central idea of reinforcement learning. It combines the advantages from Dynamic Programming and Monte Carlo methods.

Read More
machine learning

Monte Carlo Methods in RL

July 20, 2022July 20, 2022 John

In Reinforcement Learning, the Monte Carlo methods are a collection of methods for estimating the value functions and discovering optimal policies thru experience – sampling

Read More
machine learning

Solving MDPs with dynamic programming

July 17, 2022July 19, 2022 John

In Reinforcement Learning, one way to solve finite MDPs is to use dynamic programming. Policy Evaluation (of the value functions) It refers to the iterative

Read More
machine learning

Finite Markov Decision Processes (MDP)

July 16, 2022July 17, 2022 John

Finite MDP is the formal problem definition that we try to solve in most of the reinforcement learning problem. Definition Finite MDP is a classical

Read More
machine learning

Multi-Armed Bandit Problem

July 16, 2022July 19, 2022 John

Pre-requisite: some understanding of reinforcement learning. If not, you can start fromĀ Reinforcement Learning Primer Goal Let’s analyze this in the classic Multi-Armed Bandit problem using

Read More
machine learning

Reinforcement Learning Primer

July 15, 2022August 30, 2022 John

Reinforcement learning is going to be “the next big thing” in machine learning after 2022, so let’s understand some basic on how it works. Agent:

Read More

Posts pagination

Previous 1 2 3 … 7 Next

Recent Posts

  • 7 Game-Changing Strategies for Using Cold Emails in Your Data Science Job Search
  • Probability Recursion Question for DS/ML Interviews (Step-by-Step Simple Solution)
  • How To Crack the Probability Interview Questions from FAANG Company (with 3 Examples)
  • NLP Tutorial: Named Entity Recognition using LSTM and CRF
  • NLP: Word Representation and Model Comparison Tree

Recent Comments

  1. NLP Tutorial: Named Entity Recognition using LSTM and CRF - DataJello.com on Machine learning 101: what is the Confusion Matrix?
  2. NLP Tutorial: Named Entity Recognition using LSTM and CRF - DataJello.com on NLP: Word Representation and Model Comparison Tree
  3. NLP: Word Representation and Model Comparison Tree - DataJello.com on NLP: what is attention mechanism?
  4. NLP: Word Representation and Model Comparison Tree - DataJello.com on Transformer NLP Tutorial in 2022: Finetune BERT on Amazon Review
  5. NLP: Word Representation and Model Comparison Tree - DataJello.com on Knowledge Distillation (introduction)
All Rights Reserved 2021.
Proudly powered by WordPress | Theme: Fairy Lite by Candid Themes.