Loading

Harshit Sharma


Data Science Masters Student

Northeastern University (Boston)

About me

Designer by chance | Developer by choice

This is me!

Hey! Thanks for dropping by

I am a Data Science Masters student from Northeastern University, Boston. With 2.5+ years of experience as a Machine Learning Engineer, I have gained profound research and software development experience in building Deep Learning models to solve NLP problems.

I am also a co-author at my personal blog @https://intuitiveshorts.blogspot.com/ focusing on ML and Deep Learning topics. I am also a contributing writer on HackerNoon and Towards Data Science on Medium. I also like to sketch during my free time, which can be found @https://in.pinterest.com/harshit158/introverted-pixels/.

PROFILES

Technical Skills

Programming languages

Advanced

Python

Intermediate

C++

Machine Learning Tools

Web Development Frameworks

Databases

Cloud Platforms

Work Experience

Research Intern in AI and NLP

Big data Natural Language Processing ML Algorithms
The work at Surukam Analytics included multilabel classification of websites into 422 categories with training data of 20000 websites amounting to 10GB. Applied BOW model and improved traditional tf-idf approach by incorporating class-frequency. Deployed hybrid approach of three machine learning algorithms : Naive Bayes, KNN and Support Vector Machines. Also implemented Compliment Naive Bayes to mitigate the issue of skewness in the training data
Period: May 2015 - July 2015

This is me!

Captcha decoder

OCR system pipeline

Implemented complete machine learning pipeline for developing an OCR system to decode captchas.

This is me!

Texlens

Text Summarizer

• Developed Graph based Extractive Summarization tool based on the Eigenvector centrality in graph representation of sentences
• Feature matrix using Tf-Isf values was used to create Adjacency matrix with Inter-sentence cosine distance as the similarity metric

Blogs

Transfer Learning : Approaches and Empirical Observations

A good naive definition looks like : "It is the ability to transfer knowledge from one domain to another" . Technically, it is: " using weights trained on one setting (Task 1) to fit a model on another setting (Task 2)...".

Captchas into bits and pieces (1/3)

Cropping the region bounded by box Extracting individual characters def showGaps(image): separatorLine = image.copy() width,height = separatorLine.size pixels = np.asarray(separatorLine).transpose() draw = ImageDraw.Draw(separatorLine) for i in range(width) ...