Profile Image
Verified

About Me

A Full Stack Data Scientist with more than 8 years of substantial experience in Big Data Engineering. Have 5.2 Years of experience in building Machine Learning models using state of the art techniques. Expert in dealing with real-world large data sets on Cloud and On-Premise environments. Expert in these skills - Machine Learning : Regression Techniques, Decision Trees, Random Forests, Neural Nets, Clustering Techniques, Predictive modelling, Statistical Analysis, Classification analysis. Bigdata Processing & Scripting : Hive, Spark, EMR, Hadoop, HDFS,Kafka, Spark Streaming NLP : NLTK, BERT, Transformers, Deep Learning, PyTorch, Tensorflow Cloud/On Prem Platforms : AWS, Google Cloud Platform, AZURE, Databricks, MapR, Hortonworks Database: Hbase, DB2, Redshift, SQL DW, ADW, Oracle, PostgreSQL Programming Language: Python, Linux, SQL, Scala Libraries : Python with Pandas, SciKit Learn, PySpark with SparkML, Keras, Spacy Workflow Scheduling : ADF, Azure Devops – VisualStudio, AWS Lambda, ECS, Docker Framework : Django, Flask

Experience

  • Sr. Data Scientist

    Research Firm
    Apr 2019 - Till Date

    Key Responsibilities: • Lead team of 2 Data Science Engineer to build Machine Learning & Deep Learning models for question answering on Open Based dataset. • Prospect Prioritization model based on the past usage of the customers. • Apply Natural Language Processing on client call transcripts to extract key concepts and themes being discussed, identify the intent, key question being asked and map the discussion topics to internal research taxonomy using Machine Learning Models. • Use Machine Learning to develop self-correcting algorithms to cluster topics being discussed in calls to draw meaningful insights. • NLP(BERT and ELMO) based tagging of Question of Clients to SME’s. • Articulating insights, models, mathematical and statistical concepts to non-technical stakeholders. Skills/Domain: Machine Learning, Deep Learning, PyTorch, R-CNN Dev & Prod Environment: GPUs, AWS

  • Sr. Data Scientist

    Consulting Firm
    Nov 2017 - Apr 2019

    Key Responsibilities: • Working with the team of Data Science Engineer to build end to end Machine Learning platform. • Design proofs of concepts (POC) to answer targeted business questions. • Building Machine Learning & Deep Learning models using Adobe clickstream data. • Creating spark-based workflows for Data processing and feature Engineering. • Scaling the solutions for all 35 markets (different geos). • Articulating insights, models, mathematical and statistical concepts to non-technical stakeholders. • Building Recommendation Engine based on the Item-Item similarity. • Predicting the customers who are likely to be converted using Random Forest and XGBoost. • Preparing data flow pipeline in python for processing and feature Engineering. • Used python's Scikit learn for building classification model. • Running campaign’s to reach out to potential customers. Skills/Domain: Machine Learning, Deep Learning, Python, Apache Spark, Databricks Dev & Prod Environment: Microsoft Azure platform, HDInsight, DWH, Datalake

  • Sr. Data Scientist

    Service based firm
    Sep 2013 - Nov 2017

    Key Responsibilities: • Building the PySpark based pipeline for data processing and wrote Hive Queries to fetch the data from CornerStone. • Worked on K-Means clustering algorithms to detect and avoid fraud transactions. • Developed the Python code in Spark environment to produce the desired results for the customers, using which customers make business decisions. • Experienced with the Spark improving the performance and optimization of the algorithms in Hadoop using Spark Context, Spark-SQL, Data Frame, and Pair RDD's. • Responsible for design & development of Spark SQL Scripts using Python. Skills/Domain: Machine Learning, Python, Apache Spark Dev & Prod Environment: MAPR, CornerStone, CRON

Education

  • Master Of Engineering – Data Science

    BITS Pilani
    Jul 2018 - Jul 2020

  • Bachelor of Engineering

    University of Pune
    Jul 2008 - Jul 2012

Loading...

Loading...

Loading...

Loading...