← Back to all posts

#10: 5 Practical Tips for Time Series Analysis | 3 Tips for your ML Income Growth

by Timur Bikmukhametov
Apr 17, 2025
Reading time - 7 mins

 

1. ML Picks of the Week

A weekly dose of ML tools, concepts & interview prep. All in under 1 minute.

 

2. Technical ML Section

Learn 5 Practical Tips for Time Series Analysis from my experience

 

3. Career ML Section

Learn 3 tips that can help you grow your ML career income


1. ML Picks of the Week

🥇ML Tool
Python Library StatsForecast

StatsForecast is a computationally efficient library for time series forecasting, built for scale and performance.

It offers state-of-the-art implementations of classic statistical models like ARIMA, ETS, and AutoETS — all written in Numba for speed.

StatsForecast is especially powerful for:

✅ Running thousands of forecasts in parallel (yes, really fast)
✅ Getting accurate baselines for production models
✅ Automating time series pipelines with minimal code

If you’re still waiting minutes for ARIMA to run in loops, switch to StatsForecast.


📈 ML Concept
Gini Impurity

The Gini impurity is a metric used in decision trees to decide how good a split is.

It measures how mixed the classes are in a node — the lower the Gini value, the purer the node (i.e., more of the same class).

It’s fast to compute and works especially well for classification tasks.

Gini is the default split criterion in many algorithms like CART and Random Forest.

Gini is especially useful for:

✅ Building fast and interpretable classification trees
✅ Selecting the most informative features
✅ Creating consistent and efficient splits in large datasets

Read a more detailed explanation of Gini Impurity HERE.


🤔 ML Interview Question
What is one-hot encoding?

One-hot encoding is a common way to convert categorical variables into a format that machine learning models can work with.

It creates a new binary column for each unique category, assigning 1 to the matching category and 0 to the rest.

Example:
Color = Red → [Red: 1, Green: 0, Blue: 0]

Why is this important?

→ Most ML models (like linear models or neural nets) can’t work directly with text.
→ One-hot ensures that the model doesn’t assume any order or ranking between categories.
→ It’s simple, reliable, and widely used in preprocessing pipelines.

Read a more detailed intro to one-hot encoding HERE.


2. Technical ML Section

5 Practical Tips for Time Series Analysis

 

Analyzing time series might seem easy. But in real-world cases, it's not because of many reasons, like:

  • Nan values
  • Outliers
  • Noise
  • Inconsistent timestamps
  • Long, hard-to-analyze sequences

 

In this newsletter, you'll learn 5 practical tips I learned over 7 years in time series analysis. They'll help you to extract more value from data in your projects.

 


 

1️⃣ Before computing correlation, remove outliers

 

This tip is especially important when you are selecting features for your model or trying to understand feature relationships.

Imagine you have 2 time series, both having outliers as in the figure below. It can be 2 features or 1 target and 1 feature.

Time Series with outliers

 

You see that there are some outliers, but they might not be significant for your analysis, because compared to the entire sequence, their fraction is 1-2%.

If you compute the Pearson correlation between these series, you will get 0.32.

However, if you first remove them, as shown in the figure below, your correlation will reach 0.42. Big difference for practical use cases.

Time series without outliers

 


 

2️⃣ Apply resample (groupby) over a bigger time horizon than the original sampling frequency

 

This usually helps to remove noise and see the general relationships much better, including correlation.

In pandas, you can do this with "resample" method in Pandas.

 

Another alternative is "groupby".

 

Using resampling for our dataset significantly improves observability of the relationship as well as the correlation value (from 0.32 to 0.65).

Time series pandas groupby

 


 

3️⃣ Don’t use scatter, use regression and contour (kdeplot in seaborn) plots

 

This tip saved me a lot of time from the troubles.

Pro tip: combine regression and contour plots with resampled data from Tip 2.

Here is the original dataset with scatter, regression, and contour plots.

Scatter and regression plots

 

Here are the same plots on the grouped data. Now, compare the regression and contour plot below with the scatter plot above.

Way easier to see the relationships.

Regression plot for time series

 


 

4️⃣ Use Median filter to see the relationships through noise

 

Often, when you visualize the raw data, it's really hard to see if the data goes down/up/straight due to high noise.

Median filter is a very quick and easy way to see the time series trends and get an overall view of the time series behavior.

You can see how filtering the noise helps us to see the trends and cyclic behavior of our time series below.

Median Filter for time series data

 

To apply a median filter, you can use pandas rolling.

 


 

5️⃣ Use Plotly to scroll and zoom in through the time series

 

Plotly library makes it much quicker to go through the data and find insights rather than zooming and scrolling through changing the plotted indices in matplotlilb.

It does not mean that you need to use it for your entire analysis, but for trends and oscillations analysis over a time period, Plotly is very handy.

You can see how easy it is from the figure below.

Plotly for time series visualization

 

That's it for the Technical Part! 

Follow me on LinkedIn​ for more daily ML breakdowns.

Let's dive into the Career part!

 


 

2. ML Career Section

3 tips for your ML income growth

 

These tips are based on 7+ years in ML and coaching 70+ people.

They work — and they work especially well if you’re in your first one to six years in a Machine Learning career

The 3 tips are:

  1. Learn how to lead ML teams

  2. Specialize in 1–3 ML or business domains

  3. Be present online and network actively


Why do these actions help?

→ Leadership = Higher pay
You don’t need a manager title.

But you do need to take ownership, lead small projects, and drive outcomes.

People who lead, earn more.

 

→ Domain depth = Higher value
Trying to learn “everything ML” leads nowhere.

Knowing 1–2 domains deeply makes you stand out.

Companies pay more for focused expertise.

 

→ Visibility = More opportunities
If no one knows what you do, no one can hire you.

Posting online, sharing projects, and connecting with others bring jobs, freelance gigs, and credibility.

This is the rule of 2025: "If you don't exist online, you don't exist".

 

Hope this helps in your ML Career Journey!


Useful articles

  1. LASSO regression vs Ridge regression - when to use what? Why is Data
  2. Drift in ML is important to monitor

 


That is it for this week!

If you haven’t yet, follow me on LinkedIn where I share Technical and Career ML content every day!


Whenever you're ready, there are 3 ways I can help you:

​

1. ML Career 1:1 Session​

I’ll address your personal request & create a strategic plan with the next steps to grow your ML career.

 

​2. Full CV & LinkedIn Upgrade (all done for you)​

I review your experience, clarify all the details, and create:
- Upgraded ready-to-use CV (Doc format)
- Optimized LinkedIn Profile (About, Headline, Banner and Experience Sections)

 

​3. CV Review Session​

I review and show major drawbacks of your CV & provide concrete examples on how to fix them. I also give you a ready CV template to make you stand out.


Join Maistermind for 1 weekly piece with 2 ML guides:


1. Technical ML tutorial or skill learning guide
2. Tips list to grow ML career, LinkedIn, income

 

Join here! 

 

 
 
#17: What is Model Registry in ML?
Reading time - 7 mins   🥇 Picks of the Week One line data overview tool, differences in boosting algos, weekly quiz, and more. 🧠 ML Section Structure your knowledge about Model Registry and why you need to use one. Land your next ML job. Fast. ​I built the ML Job Landing Kit to help Data Professionals land jobs faster! ✅ Here’s what’s inside: - 100+ ML Interview Q & A - 25-page CV Crafting...
#16: How to tune LSTM models
  Reading time - 7 mins   🥇 Picks of the Week Best Python dashboard tool, clustering concepts, weekly quiz, and more. 🧠 ML Section Learn practical tips on how to tune LSTM Neural Networks Land your next ML job. Fast. ​I built the ML Job Landing Kit to help Data Professionals land jobs faster! ✅ Here’s what’s inside: - 100+ ML Interview Q & A - 25-page CV Crafting Guide for ML - 10-page Li...
#15: Deep Learning and Transformers - the roadmap | Visibility > technical skills - why?
  Reading time - 7 mins   🥇 Picks of the Week New best LLM for coding, baseline library for anomaly detection, and more.   🧠 ML Section Deep Learning & Transformers - the roadmap   💰 ML Section Why visibility > technical skills in getting ML jobs Land your next ML job. Fast. ​I built the ML Job Landing Kit to help Data Professionals land jobs faster! ✅ Here’s what’s inside: - 100+ ML Interv...
Powered by Kajabi