π India High School Exoplanet Data Challenge β Uncover New Worlds with AI
How do astronomers find planets light-years away?
They don't see them directly. They watch starlight β and look for the faint, telltale dips that betray a planet passing in front of its star.
Now, it's your turn.
We invite high school students across India to step into the role of an astrophysicist and data scientist. Using a curated dataset sourced directly from the NASA Exoplanet Archive, your mission is to build a machine learning classification model that separates real exoplanet candidates from noise and false signals β the exact kind of problem professionals at NASA and research institutions tackle every day.
π The Mission
You'll receive a starter CSV file with real observational data. From there, it's on you:
π Explore β understand distributions, correlations, and anomalies
π§Ή Clean β handle missing values and class imbalance thoughtfully
π€ Model β apply classification algorithms of your choice (Random Forest, XGBoost, Neural Networks β your call)
π Interpret β explain what your model learned and why it matters
No astrophysics background required. Just curiosity, code, and commitment :)
π― Why This Challenge Exists
Most student competitions give you toy datasets and synthetic problems. This one gives you real data from a real space telescope.
We designed this challenge to bridge the gap between classroom learning and genuine scientific work β and to give students a project that belongs on a college application, a research portfolio, or a GitHub that actually stands out.
This is applied AI for space science, and you don't have to wait until university to do it.
β±οΈ Duration: 1.5 months of self-paced, hands-on work.
π Prizes & Recognition
π₯ Top Solutions Published β Winning models and write-ups featured on our platform's newsletter and shared with the STEM community
π Certificates of Excellence β Awarded to all top-scoring participants
π Academic Credibility β A real, portfolio-ready project backed by NASA data β the kind that stands out to colleges and research programs
π€©Something exciting β To be soon announced!!
π₯ Eligibility
π Open to high school students (Grades 9β12) across India
π€ Solo participation or teams of up to 3 members
π Free to enter
Requirements
π Submission Requirements
To submit to the India High School Exoplanet Data Challenge, each team must provide all of the following:
β Required Deliverables
1. π» Code / Notebook
- A clean, well-commented Jupyter Notebook (.ipynb) or Python script (.py)
- Must be reproducible β someone else should be able to run it top to bottom without errors
- Upload to GitHub and share the public repository link on your Devpost submission
2. π Model Results
- A summary table of your model's performance metrics: Accuracy, Precision, Recall, F1-Score
- Confusion matrix (image or inline in notebook)
- Any visualisations supporting your findings (feature importance plots, ROC curves, etc.)
3. π Written Summary / Report
A written explanation (500β1000 words) covering:
β Your approach to EDA and data cleaning
β Why you chose your model(s)
β Key findings and what the model learned
β How you would explain your predictions to a non-technical audience
Can be included inside the notebook or as a separate PDF
4. ποΈ Devpost Project Page
- Fill in your project title, description, and team members on Devpost
- Embed or link your GitHub repository
- Upload at least one screenshot or visualisation from your notebook
β οΈ Submission Rules at a Glance
- All work must be completed within the challenge window
- Submissions must be in English
- One submission per team
Prizes
1st Place
Work published in the Celesta newsletter Β· Internship offer at Celesta Β· LOR from Celesta Β· Featured on all Celesta social media platforms Β· Personal LinkedIn recommendation from Celesta President Β· Certificate of Excellence
2nd Place
Work published in the Celesta newsletter Β· Internship offer at Celesta Β· LOR from Celesta Β· Certificate of Excellence Β· Celesta social media feature
3rd Place
Work published in the Celesta newsletter Β· Certificate of Excellence Β· LOR from Celesta Β· Celesta social media feature
Best Explainability Award: Submission that makes complex ML most understandable to a layman-audience.
Certificate of Excellence Β· Celesta social media feature
Rising Star Award For the strongest solo participant, no team.
Certificate of Excellence Β· Celesta social media feature
Devpost Achievements
Submitting to this hackathon could earn you:
Judges
Aradhya Haldikar
President | Celesta
Mrunal Sayam
Vice-President | Celesta
Arnav Venkatesh
Technical Lead | Celesta
Judging Criteria
-
Model Performance (35%)
F1-score, balance of precision and recall, and overall generalisability. -
EDA & Data Handling (20%)
Strategies for missing values and class imbalance, plus clear visual insights. -
Feature Engineering (20%)
Domain-aware derived features and creative feature combinations. -
Explainability (15%):
Feature importance and plain-language interpretation. -
Write-up Quality (10%)
Clarity, structure, and reproducibility.
Questions? Email the hackathon manager
Invite others to compete
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
