A cheat sheet for statistical estimators

What distribution and parameters fit our data? What is the confidence interval of the estimated parameters?

The above article contains the formulas with minimal explanations. If you have seen this topic some time ago, it is probably a good way to refresh memories. However, this would be a bit too harsh to start from scratch with it. …

An introduction to classification algorithms

In the precedent article, I spoke about the creation of a Bag of Words in R. Then, I made tests on two different datasets. This is a good occasion to tell a little about some commonly used classification algorithms.

I will give some code samples in R, but this is…

How to create a Bag of Words embedding in R?

Bag of Word embedding is a Natural Language Processing technic to embed sentences into a fixed-size numeric vector. The goal is to use this vector as an input for a machine learning algorithm.

Bag of Words is simple to understand an is a great technic when you want to keep…

A glimpse of the maths behind Optimization Algorithms

Ok, let’s start…

(∩｀-´)⊃━☆ﾟ.*･｡ﾟ

import numpy as np
from scipy.optimize import minimize

End…

No? Ok… ¯\_(ツ)_/¯ We need to dive a bit more into maths…

A cheatsheet of mathematical formulas for fundations of Continuous Optimization

A big part of the above formulas are from my notes at the DSTI. I also added curves I found or created, as well as more gradient descent algorithms. Please note that it is a formulas cheat sheet, not a course. It is good to check or refresh your knowledge.

Common special characters and Unicode symbols to copy-paste for mathematics…

Because it is sometime useful…

Super/sub script:

• Numerical exponents: ⁰ ¹ ² ³ ⁴ ⁵ ⁶ ⁷ ⁸ ⁹
• Numerical indices: ₀ ₁ ₂ ₃ ₄ ₅ ₆ ₇ ₈ ₉
• Superscript : ᵃ ᵇ ᶜ ᵈ ᵉ ᶠ ᵍ ʰ ⁱ ʲ ᵏ ˡ ᵐ ⁿ ᵒ ᵖ ʳ…

Descriptive statistics and probability formulas

This is a cheat-sheet for descriptive statistics and probability with some R. It is for a big part from my notes of the DSTI courses, while some concepts are from other courses. It starts with the very basics and will cover more advanced features over time.

On time to time…

General calculus formulas

This one is a cheat-sheet for pretty general formulas of calculus such as derivatives, integrales, trigonometry, complex numbers… Something you may find useful in many contexts. It is also a good way to check what you remember years after school… ¯\_(ツ)_/¯

How to create, and explain, ugly density histograms with R…

x <- rgamma(1000,1)
hist(x, c(0,1,3,10))

The above histogram is an alternative way to plot the probability density function of the gamma distribution, on a sample of 1000 items.

Easy to plot, thanks to the “hist” function, right ? But how does it work ?

Here is the position of each… Thibaut

Publications in English & French about Data Science, Artificial Intelligence, and Innovation.