🌌 India High School Exoplanet Data Challenge β€” Uncover New Worlds with AI

How do astronomers find planets light-years away?

They don't see them directly. They watch starlight β€” and look for the faint, telltale dips that betray a planet passing in front of its star.

 

Now, it's your turn.

 

We invite high school students across India to step into the role of an astrophysicist and data scientist. Using a curated dataset sourced directly from the NASA Exoplanet Archive, your mission is to build a machine learning classification model that separates real exoplanet candidates from noise and false signals β€” the exact kind of problem professionals at NASA and research institutions tackle every day.

 

πŸš€ The Mission

You'll receive a starter CSV file with real observational data. From there, it's on you:

πŸ” Explore β€” understand distributions, correlations, and anomalies

🧹 Clean β€” handle missing values and class imbalance thoughtfully

πŸ€– Model β€” apply classification algorithms of your choice (Random Forest, XGBoost, Neural Networks β€” your call)

πŸ“Š Interpret β€” explain what your model learned and why it matters

 

No astrophysics background required. Just curiosity, code, and commitment :)

 

 

🎯 Why This Challenge Exists

Most student competitions give you toy datasets and synthetic problems. This one gives you real data from a real space telescope.

We designed this challenge to bridge the gap between classroom learning and genuine scientific work β€” and to give students a project that belongs on a college application, a research portfolio, or a GitHub that actually stands out.

This is applied AI for space science, and you don't have to wait until university to do it.

 

⏱️ Duration: 1.5 months of self-paced, hands-on work.

 

πŸ† Prizes & Recognition

πŸ₯‡ Top Solutions Published β€” Winning models and write-ups featured on our platform's newsletter and shared with the STEM community

πŸ“œ Certificates of Excellence β€” Awarded to all top-scoring participants

πŸŽ“ Academic Credibility β€” A real, portfolio-ready project backed by NASA data β€” the kind that stands out to colleges and research programs

🀩Something exciting β€” To be soon announced!!

 

πŸ‘₯ Eligibility

πŸ“ Open to high school students (Grades 9–12) across India

πŸ‘€ Solo participation or teams of up to 3 members

πŸ†“ Free to enter

Requirements

πŸ“‹ Submission Requirements

To submit to the India High School Exoplanet Data Challenge, each team must provide all of the following:

 

βœ… Required Deliverables

 

1. πŸ’» Code / Notebook

  • A clean, well-commented Jupyter Notebook (.ipynb) or Python script (.py)
  • Must be reproducible β€” someone else should be able to run it top to bottom without errors
  • Upload to GitHub and share the public repository link on your Devpost submission

 

2. πŸ“Š Model Results

  • A summary table of your model's performance metrics: Accuracy, Precision, Recall, F1-Score
  • Confusion matrix (image or inline in notebook)
  • Any visualisations supporting your findings (feature importance plots, ROC curves, etc.)

 

3. πŸ“ Written Summary / Report

A written explanation (500–1000 words) covering:

    β€” Your approach to EDA and data cleaning

    β€” Why you chose your model(s)

    β€” Key findings and what the model learned

    β€” How you would explain your predictions to a non-technical audience

Can be included inside the notebook or as a separate PDF

 

4. πŸ—‚οΈ Devpost Project Page

  • Fill in your project title, description, and team members on Devpost
  • Embed or link your GitHub repository
  • Upload at least one screenshot or visualisation from your notebook

 

⚠️ Submission Rules at a Glance

  • All work must be completed within the challenge window
  • Submissions must be in English
  • One submission per team

Hackathon Sponsors

Prizes

5 non-cash prizes
1st Place
1 winner

Work published in the Celesta newsletter Β· Internship offer at Celesta Β· LOR from Celesta Β· Featured on all Celesta social media platforms Β· Personal LinkedIn recommendation from Celesta President Β· Certificate of Excellence

2nd Place
1 winner

Work published in the Celesta newsletter Β· Internship offer at Celesta Β· LOR from Celesta Β· Certificate of Excellence Β· Celesta social media feature

3rd Place
1 winner

Work published in the Celesta newsletter Β· Certificate of Excellence Β· LOR from Celesta Β· Celesta social media feature

Best Explainability Award: Submission that makes complex ML most understandable to a layman-audience.
1 winner

Certificate of Excellence Β· Celesta social media feature

Rising Star Award For the strongest solo participant, no team.
1 winner

Certificate of Excellence Β· Celesta social media feature

Devpost Achievements

Submitting to this hackathon could earn you:

Judges

Aradhya Haldikar
President | Celesta

Mrunal Sayam

Mrunal Sayam
Vice-President | Celesta

Arnav Venkatesh

Arnav Venkatesh
Technical Lead | Celesta

Judging Criteria

  • Model Performance (35%)
    F1-score, balance of precision and recall, and overall generalisability.
  • EDA & Data Handling (20%)
    Strategies for missing values and class imbalance, plus clear visual insights.
  • Feature Engineering (20%)
    Domain-aware derived features and creative feature combinations.
  • Explainability (15%):
    Feature importance and plain-language interpretation.
  • Write-up Quality (10%)
    Clarity, structure, and reproducibility.

Questions? Email the hackathon manager

Invite others to compete

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.