santomar

About Me

A Full Stack Data Scientist with more than 8 years of substantial experience in Big Data Engineering. Have 5.2 Years of experience in building Machine Learning models using state of the art techniques. Expert in dealing with real-world large data sets on Cloud and On-Premise environments. Expert in these skills - Machine Learning : Regression Techniques, Decision Trees, Random Forests, Neural Nets, Clustering Techniques, Predictive modelling, Statistical Analysis, Classification analysis. Bigdata Processing & Scripting : Hive, Spark, EMR, Hadoop, HDFS,Kafka, Spark Streaming NLP : NLTK, BERT, Transformers, Deep Learning, PyTorch, Tensorflow Cloud/On Prem Platforms : AWS, Google Cloud Platform, AZURE, Databricks, MapR, Hortonworks Database: Hbase, DB2, Redshift, SQL DW, ADW, Oracle, PostgreSQL Programming Language: Python, Linux, SQL, Scala Libraries : Python with Pandas, SciKit Learn, PySpark with SparkML, Keras, Spacy Workflow Scheduling : ADF, Azure Devops – VisualStudio, AWS Lambda, ECS, Docker Framework : Django, Flask

My Services

Data Science/Machine Learning/NLP/Deep Learning Interviewer and Trainer

Buy Now

Experience

Sr. Data Scientist

Research Firm

Apr 2019 - Till Date

Key Responsibilities: • Lead team of 2 Data Science Engineer to build Machine Learning & Deep Learning models for question answering on Open Based dataset. • Prospect Prioritization model based on the past usage of the customers. • Apply Natural Language Processing on client call transcripts to extract key concepts and themes being discussed, identify the intent, key question being asked and map the discussion topics to internal research taxonomy using Machine Learning Models. • Use Machine Learning to develop self-correcting algorithms to cluster topics being discussed in calls to draw meaningful insights. • NLP(BERT and ELMO) based tagging of Question of Clients to SME’s. • Articulating insights, models, mathematical and statistical concepts to non-technical stakeholders. Skills/Domain: Machine Learning, Deep Learning, PyTorch, R-CNN Dev & Prod Environment: GPUs, AWS
Sr. Data Scientist

Consulting Firm

Nov 2017 - Apr 2019

Key Responsibilities: • Working with the team of Data Science Engineer to build end to end Machine Learning platform. • Design proofs of concepts (POC) to answer targeted business questions. • Building Machine Learning & Deep Learning models using Adobe clickstream data. • Creating spark-based workflows for Data processing and feature Engineering. • Scaling the solutions for all 35 markets (different geos). • Articulating insights, models, mathematical and statistical concepts to non-technical stakeholders. • Building Recommendation Engine based on the Item-Item similarity. • Predicting the customers who are likely to be converted using Random Forest and XGBoost. • Preparing data flow pipeline in python for processing and feature Engineering. • Used python's Scikit learn for building classification model. • Running campaign’s to reach out to potential customers. Skills/Domain: Machine Learning, Deep Learning, Python, Apache Spark, Databricks Dev & Prod Environment: Microsoft Azure platform, HDInsight, DWH, Datalake
Sr. Data Scientist

Service based firm

Sep 2013 - Nov 2017

Key Responsibilities: • Building the PySpark based pipeline for data processing and wrote Hive Queries to fetch the data from CornerStone. • Worked on K-Means clustering algorithms to detect and avoid fraud transactions. • Developed the Python code in Spark environment to produce the desired results for the customers, using which customers make business decisions. • Experienced with the Spark improving the performance and optimization of the algorithms in Hadoop using Spark Context, Spark-SQL, Data Frame, and Pair RDD's. • Responsible for design & development of Spark SQL Scripts using Python. Skills/Domain: Machine Learning, Python, Apache Spark Dev & Prod Environment: MAPR, CornerStone, CRON

Education

Master Of Engineering – Data Science

BITS Pilani

Jul 2018 - Jul 2020
Bachelor of Engineering

University of Pune

Jul 2008 - Jul 2012

santomar

Data Science Interviewer and Trainer

About Me

My Services

Data Science/Machine Learning/NLP/Deep Learning Interviewer and Trainer

Experience

Sr. Data Scientist

Sr. Data Scientist

Sr. Data Scientist

Education

Master Of Engineering – Data Science

Bachelor of Engineering

My Skills

Find Similar Interviewers

Report this Interviewer

For Candidates

For Employers

Company

Explore More

Sign Up For Newsletter

Weekly breaking news, analysis and cutting edge advices on job searching.