Amisha Das
Passionate Data Scientist, Astroinformatician and Developer
Skills
Astroinformatics and Data Science
Machine Learning
Projects
Data Dashboards
Data Analysis Dashboards | Tableau, R/RStudio, Shiny
Data Reports
Reports | R/Rstudio/RMD, LaTeX
Distributed ETL Streaming Pipeline for Astroinformatics

A production-style streaming ETL pipeline built for astroinformatics. The system ingests survey CSVs via Kafka, processes them with Spark streaming, writes normalized tables to PostgreSQL (with pgAdmin), and exposes interactive analysis via Jupyter — all reproducible with Docker Swarm and a Makefile for quick deployment.
AWS Cloud-Based Stroke Risk ETL & Analytics Pipeline
A cloud-native end-to-end data engineering project designed to collect, transform, store, and analyze healthcare data related to stroke risk factors using AWS and Python. This project demonstrates how a scalable ETL pipeline can power public-health analytics — identifying trends, patterns, and risk correlations in real-world healthcare data.
CNN to Classify Galaxies
Convolutional Neural Network (CNN) for classifying galaxy morphologies using the Galaxy10 dataset. The project utilizes TensorFlow for building and training the model and follows standard preprocessing steps to ensure efficient data handling and model performance.
Gametrax



Full Stack development of an Android app (built with Flutter/Dart + Firebase + Figma) that helps gamers search, track and organise games, view latest gaming news, and check basic store info — all in one place.