Skip to content

Assignments for the interviews of our AI team

License

Notifications You must be signed in to change notification settings

xtreamsrl/xtream-ai-assignment-intern

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

xtream AI Challenge

Ready Player 1? 🚀

Hey there! If you're reading this, you've already aced our first screening. Awesome job! 👏👏👏

Welcome to the next level of your journey towards the xtream AI squad. Here's your cool new assignment.

Among the datasets described below, pick just one that catches your eye. Each dataset comes with its own set of challenges. Don't stress about doing them all. Just dive into the ones that spark your interest or that you feel confident about. Let your talents shine bright! ✨

Take your time – you've got 10 days to show us your magic, starting from when you get this. No rush, work at your pace. If you need more time, just let us know. We're here to help you succeed. 🤝

What You Need to Do

Think of this as a real-world project. Fork this repo and treat it as if you're working on something big! When the deadline hits, we'll be excited to check out your work. No need to tell us you're done – we'll know. 😎

🚨 Heads Up: You might think the tasks are a bit open-ended or the instructions aren't super detailed. That’s intentional! We want to see how you creatively make the most out of the data and craft your own effective solutions.

🚨 Remember: At the end of this doc, there's a "How to run" section left blank just for you. Please fill it in with instructions on how to run your code – it's important!

How We'll Evaluate Your Work

We'll be looking at a bunch of things to see how awesome your work is, like:

  • Your workflow
  • How well you understand the problem and data
  • The approach to the analysis and clarity of your conclusions
  • How neat your code is (relative to your experience level)

🚨 Keep This in Mind: This isn't about building the fanciest model: we're more interested in your process and thinking.

This is your chance to showcase your unique approach and thought process. Don't worry if your code isn't perfect or your model isn't top-notch yet. We've been in your shoes and are here to help you grow. 🌟


Diamonds

Problem type: Regression

Dataset description: Diamonds Readme

Meet Don Francesco, the mystery-shrouded, fabulously wealthy owner of a jewelry empire.

He's got an impressive collection of 5000 diamonds and a temperament to match - so let's keep him smiling, shall we? In our dataset, you'll find all the glittery details of these gems, from size to sparkle, along with their values appraised by an expert. You can assume that the expert's valuations are in line with the real market value of the stones.

Challenge 1

Francesco wonders: what makes a diamond valuable? You should provide him with an answer.

Don Francesco has been very clear with you: he is not a fan of tech jargon, so keep your message plain and simple. However, he trusts no one - certainly not you. He's hired Luca, another data scientist, to double-check your findings (no pressure!). Your mission is simple.

Create a Jupyter notebook to explain what Francesco should look at and why. Your code should be understandable by a data scientist like Luca, but your text and visualizations should be clear for a layman like Francesco.

Challenge 2

Plot twist! The expert who priced these gems has now vanished. Francesco needs you to be the new diamond evaluator. He's looking for a model that predicts a gem's worth based on its characteristics. And, because Francesco's clientele is as demanding as he is, he wants the why behind every price tag.

Create another Jupyter notebook where you develop and evaluate your model.

So, ready to add some sparkle to this challenge? Let's make these diamonds shine! 🌟💎✨


Italian Power Load

Problem type: time series forecasting

Dataset description: Power Load readme

Welcome to your first day at the office, and what a charged-up project you have! Your client is Zap Inc, a fictional power player in Italy. They're handing you data on Italy's power load from 2006 to 2022. Marta, your wise colleague, has a piece of advice for you: be careful with 2020, it was a very peculiar year.

Challenge 1

Zap Inc asks you for a complete report about the main feature of the power load series. The report should be understandable by a layman, but it should also provide enough details to be useful for a data scientist. Create a Jupyter Notebook that sheds light on the main characteristics of the power load data. Make it clear, make it insightful!

Challenge 2

Now, it's time to predict the future, well, at least a year into it. Develop a long-term forecasting model for the power load with a forecasting horizon of one year ahead, but let's skip 2020-2022 and use 2019 as your testing ground. Marta's advice: Zap's bosses aren't AI gurus, so your model's accuracy and explainability need to be crystal clear. Illuminate these points with a second notebook.

Ready to electrify your career with this project? Let's power through! ⚡🔌📈


Employee Churn

Problem type: classification

Dataset description: Employee churn readme

Your first client is Pear Inc, a multinational company worried about its poor talent retention. Pear has a peculiar hiring strategy. They offer free classes and hire the best students. The strategy is working, but many new hires leave the company after a few months. This is a huge waste of time and money.

In the past few months, they collected a dataset with information about their employees and recorded whether they churned or not. Due to the churning period being so short, they are confident that the history of each candidate is enough to predict the churn and the experience is Pear is not relevant.

Gabriele, the Head of Talent, is counting on you to put a plug in this problem. And Fabio, an ML Engineer and new colleague of yours, will be eyeing your work too!

Challenge 1

Pear Inc needs your help to figure out what makes an employee stick versus split. Craft a Jupyter notebook to answer their question. Keep it simple enough for Gabriele but detailed enough to dazzle Fabio.

Challenge 2

Next up, Pear Inc needs a crystal ball to foresee who will staty with them. Develop a model that predicts employee churn, complete with churn probabilities to help Gabriele take timely action. Remember, Gabriele's no fan of black-box machines, and Fabio's looking for proof that your model's magic works.

So, ready to tackle Pear Inc's perplexing problem and turn the tide on talent turnover? Let's get cracking! 🍐💼🔍


How to run

Please fill this section as part of the assignment.