Classified Reddit conversations into 8 news groups using classifiers like Naive Bayes, SVM and Decision Trees. Our team scored first position on the Kaggle leader-board set for this competition.
Technologies used: Python with Sci-kit learn, Numpy, Scipy and Pandas.
View on GitHub
Predicted the winner by training the models using Linear Regression, Naive Bayes and Logistic Regression. I devised various methods to transform the numerical and continuous data into categorical which eventually helped us in getting a generalized solutions in terms of the range of years of participation given for each participant. Our team predicted the first runner-up in real marathon of 2017 as winner, accurately.
Technologies used: Python with Sci-kit learn, Numpy, Scipy and Pandas.
View on GitHub
Implemented Feed-Forward Neural Networks from scratch to classify images into 40 different categories. Our team experimented with numerous preprocessing methods and chose the best methods after applying rigorous techniques to validate each transformation and its effect on the data and predictions. Along with the FFNN, we used Deep CNN to classify the images which achieved an accuracy of 90%.
Technologies used: Python with Sci-kit learn, Numpy, Keras and Pandas.
View on GitHub
Privacy Preservation in Machine Learning Open Source
Surveyed and conducted a comparative study of differential privacy techniques to preserve the sensitive data. Applied the studied techniques on data sets like MNIST, SVHN and CIFAR-10 with images.
Technologies used: Python with Sci-kit learn, Numpy and Pandas.
Building Compiler for GO LanguageOpen Source
Built a compiler written in C, for compiling the target language "GOLang". I used flex and bison for creating lexer and parser. Optimized the compiler using pattern matching.
Technologies used: C, GoLang, Flex and Bison.
Summarized the restaurant reviews using phrase based semantic analysis in NLP and predicted the user ratings based on the summarized semantics of the reviews.
Technologies used: Python with Sci-kit learn, Numpy, NLTK and Pandas.
View on GitHub
Validated the claims made by @tComment tool’s authors and re-implemented the technique that achieved an improvement in accuracy by 10% over the original tool.
Technologies used: Java
View on GitHub
Summarizing documents Open Source
Generated automatic document summaries with 75% similarity to human created summaries.
Technologies used: Python with Sci-kit learn, Numpy and NLTK.
Hostel Allocation Portal Open Source
Built an application automated the process of hostel allocation in my college- UIET, Panjab University. It managed allocations for a total of 16 hostels. Students could get the hostel forms online and consider the waiting lists of hostels.
Technologies used: JSP, Servlets, MySQL and AJAX.
Imperial Finance Open Source
Built an application to provide consumers with a facility to get loans from multiple banks through a single interface. It handled personal information of customers, process of initiation of a loan request, tracking the request and monthly reports for the administrator.
Technologies used: JSP, Servlets, MySQL and AJAX.