Sanjay Natraj: Data Scientist/hadoop Engineer

Uploaded by

Pallav Anand

0% found this document useful (0 votes)

33 views2 pages

afsfafasfas

Original Title

Sanjay Natrajsffafsaf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

afsfafasfas

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

33 views2 pages

Sanjay Natraj: Data Scientist/hadoop Engineer

Uploaded by

Pallav Anand

afsfafasfas

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

sanjay natraj

data scientist/hadoop engineer

Chicago, IL - Email me on Indeed: indeed.com/r/sanjay-natraj/375b258f28552492
Willing to relocate: Anywhere
Authorized to work in the US for any employer

WORK EXPERIENCE

Data Scientist
Data Science - March 2016 to March 2016
Working on enhancements on PIG Scripts to include more Topics.
Working on creation of Pig UDFs to process User Cookies Data.
Working on Table interaction from HIVE into PIG using Hcatalog.
Maintain all the Java Code created for PIG UDFs in GIT Repository.
Use Maven as a tool to Build , Compile and deploy Code.
Created UDF's using python for Queries in Hive.
Developed POC in Spark to Query the dataset.
Worked on Data Ingestion using Sqoop from Oracle to Hdfs.
Schema design for Hbase.
Implemented Java API to fetch the data from Hbase based on Row Key Design.
Written Scala Code to Perform Data Analysis using RDD's.
Worked on Complex Data Structures (Arrays, Structs, Array of Structs, Maps, Array of Maps) in Hive

Data Scientist
Data Science , EMC Corporation - Hopkinton, MA - June 2012 to August 2015
Worked with CTO office in designing an analytics-driven recommendation engine for the healthcare industry
Experience in dealing with Apache Hadoop components like HDFS, MapReduce, HiveQL, HBase, Pig,
Sqoop, Ozzie, Mahout, Cassendra, Mongo db, Big Data and Big Data Analytics.

EDUCATION

Custom UDFs
Secondary Namenode

ADDITIONAL INFORMATION
SKILLS
Languages: JAVA, J2EE, HTML5, CSS3, Java Script,CC++ R
Big Data Tools:
Hadoop distributions, MapReduce, YARN, Hive, HBase, Sqoop, Pig, Oozie, Zookeeper, R- Language,
R-Studio, R- Commander, Matlab, GIT, Sublime, SVN, JBOSS Drools, Tomcat, ETL, Hadoop, Spark,
MapReduce, Pig, Hive, NoSQL, HDFS, Elastisearch, Tensorflow.
IDE / Tools /Framework: NetBeans, Eclipse, Putty, Cygwin, Git, Maven, JIRA, Jenkins, SOAP UI
Database Oracle 9/10g, SQL Server, MySQL

PROJECTS
Handwritten digits classification using Artificial Neural Network and Logistic Regression technique (Python)
Implemented a multilayer perceptron neural network using feed forward, back propagation units and
evaluated its performance in classifying digits using sufficiently large datasets procured from MNIST
The result achieved 98% accuracy by optimal regularization on weights and tuning hyper parameters for
the neural network
A multi class classification of handwritten digits using logistic regression model was also developed using
the same dataset. The result reported an accuracy of 92.4%.
Convolutional Neural Network(CNN) in NLP using Google's Tensorflow (Python)
Implemented a CNN and tested its performance in classifying movie reviews on datasets acquired from
Rotten Tomatoes. Achieved an accuracy of 74% by tweaking model hyper parameters and loss below 10%.
This can be improved by increasing training epochs and batch sizes over a very large dataset.
Time Series forecast of stocks using datasets from NASDAQ (R)
Computed time series forecast of data at Center for Computational Research (CCR) using linear regression
model, Holt-Winters Model, and ARIMA model.
Evaluated the error measure (MAE) for the three models and calculated the stocks with minimum price for
all three models.
Comparative analysis of PIG, Hive, MapReduce in Stock volatility calculation using NASDAQ data (SQL,
Java) o Implemented the business logic for calculating least volatile stocks using PIG, Hive and Java
MapReduce. o Pig, Hive and MapReduce jobs were incrementally scaled over core sizes ranging from 1 to 48.
Compared the Performance of both Pig and Hive to MapReduce over certain performance markers such as
complexity of code, running time, ease of use.
Storage and analysis of twitter data using accumulo (Python, Java) o Acquired twitter data for 30 NBA teams
using Oauth API and implemented accumulo to store.
o Executed a MapReduce on top of it to determine the popularity count of NBA teams on the data sample

Case Interview Sample Questions
Document2 pages
Case Interview Sample Questions
snaren777
67% (3)
Resume
Document4 pages
Resume
shekhar
No ratings yet
STUTI - GUPTA Hadoop Resume PDF
Document2 pages
STUTI - GUPTA Hadoop Resume PDF
Noble kumar
No ratings yet
40 Interview Questions Asked at Startups in Machine Learning - Data Science
Document33 pages
40 Interview Questions Asked at Startups in Machine Learning - Data Science
Pallav Anand
100% (3)
Iswarya - SR - Bigdata Hadoop Developer
Document8 pages
Iswarya - SR - Bigdata Hadoop Developer
Vrahta
No ratings yet
Vipul Sinha BigData-Hadoop Dev
Document8 pages
Vipul Sinha BigData-Hadoop Dev
MA
100% (1)
STUTI - GUPTA Hadoop Resume PDF
Document2 pages
STUTI - GUPTA Hadoop Resume PDF
Noble kumar
No ratings yet
Vivek Varma K: Data Scientist - Data Analyst
Document5 pages
Vivek Varma K: Data Scientist - Data Analyst
Amit Pandey
No ratings yet
Akhil Data+Engineer1
Document5 pages
Akhil Data+Engineer1
Vivek Sagar
No ratings yet
Hadoop Blueprints
From Everand
Hadoop Blueprints
Anurag Shrivastava
No ratings yet
Dice Resume CV Kumar Hari
Document6 pages
Dice Resume CV Kumar Hari
Naman Bhardwaj
No ratings yet
Resume - Riaz Mahmud
Document8 pages
Resume - Riaz Mahmud
Akshay
No ratings yet
Mujtaba Latest
Document8 pages
Mujtaba Latest
Mirza Mujtaba Baig
No ratings yet
Vishwa SrDataEngineer Resume
Document4 pages
Vishwa SrDataEngineer Resume
HARSHA
No ratings yet
Ram Madhav Resume
Document6 pages
Ram Madhav Resume
ramu_uppada
No ratings yet
Vinaykanth Pythondeve
Document8 pages
Vinaykanth Pythondeve
jobsbay
No ratings yet
Dhanush Bigdata Resume Updated
Document9 pages
Dhanush Bigdata Resume Updated
Nishant Kumar
No ratings yet
Django Rest API
Document117 pages
Django Rest API
Rohini Karale
No ratings yet
Pranjal Soni: Professional Summary
Document4 pages
Pranjal Soni: Professional Summary
amit12289
No ratings yet
Bentley Autoplant Design V8i Ss1
Document550 pages
Bentley Autoplant Design V8i Ss1
Solomon Emavwodia
50% (2)
Chandralekha Rao Yachamaneni
Document7 pages
Chandralekha Rao Yachamaneni
Kritika Shukla
No ratings yet
Guide To Open Cloud
Document23 pages
Guide To Open Cloud
dr4sk0
No ratings yet
Dice Resume CV Vijay Krishna
Document4 pages
Dice Resume CV Vijay Krishna
RAJU P
No ratings yet
MonishKunar DataAnalyst Resume
Document3 pages
MonishKunar DataAnalyst Resume
valish silverspace
No ratings yet
Learning Hadoop 2
From Everand
Learning Hadoop 2
Garry Turkington
Rating: 4 out of 5 stars
4/5 (1)
Dice Resume CV Likitha Pailla
Document5 pages
Dice Resume CV Likitha Pailla
HARSHA
No ratings yet
Deepak (Sr. Data Engineer)
Document10 pages
Deepak (Sr. Data Engineer)
ankul
No ratings yet
Anusha K Phone No: (929) 456-3121 Senior Data Engineer: Summary
Document7 pages
Anusha K Phone No: (929) 456-3121 Senior Data Engineer: Summary
harsh
No ratings yet
Mohit ShivramwarCV
Document5 pages
Mohit ShivramwarCV
Noor Ayesha Iqbal
No ratings yet
Dice Resume CV Karthik S
Document4 pages
Dice Resume CV Karthik S
RAJU P
No ratings yet
Swapnik DE
Document6 pages
Swapnik DE
Santhosh Kumar
No ratings yet
Samples Resume AWS
Document4 pages
Samples Resume AWS
Parth Agrawal Pro-Tek Consulting Inc
No ratings yet
01 01
Document6 pages
01 01
vitig2
No ratings yet
Working With Databricks Tables, Databricks File System (DBFS) Etc
Document3 pages
Working With Databricks Tables, Databricks File System (DBFS) Etc
aniruddha
No ratings yet
B
Document4 pages
B
aniruddha
No ratings yet
Jayasree Yedlapally: Data Architecture Engineering - Senior
Document5 pages
Jayasree Yedlapally: Data Architecture Engineering - Senior
Shantha Gopaal
No ratings yet
Sharath Res
Document7 pages
Sharath Res
Srilakshmi M
No ratings yet
Donald Ngandeu 1
Document6 pages
Donald Ngandeu 1
Noor Ayesha Iqbal
No ratings yet
PR Ofessional Summary: Data Frames and RDD's
Document6 pages
PR Ofessional Summary: Data Frames and RDD's
Recruitment
No ratings yet
Rama
Document7 pages
Rama
Prabhakar Reddy Bokka
No ratings yet
Professional Summary
Document7 pages
Professional Summary
Prabhakar Reddy Bokka
No ratings yet
Naveen Kumar Nemani Sr. Big Data Engineer: Summary
Document6 pages
Naveen Kumar Nemani Sr. Big Data Engineer: Summary
Vrahta
No ratings yet
List of Vendors
Document5 pages
List of Vendors
aalexa
No ratings yet
Hemanth Hadoop
Document3 pages
Hemanth Hadoop
vinodh.bestinsurance
No ratings yet
Dice Resume CV SN
Document5 pages
Dice Resume CV SN
Shivam Pandey
No ratings yet
Data Scientist/ Machine Learning Engineer: Summary
Document4 pages
Data Scientist/ Machine Learning Engineer: Summary
harsh
No ratings yet
Chaitanya - Sr. Data Engineer
Document7 pages
Chaitanya - Sr. Data Engineer
abhay.rajauriya1
No ratings yet
Akshay Godugu Phone: (424) 272-5152: Required Skills/Experience # Years
Document6 pages
Akshay Godugu Phone: (424) 272-5152: Required Skills/Experience # Years
srinivaskumarus
No ratings yet
Dice Resume CV Deema Alk
Document6 pages
Dice Resume CV Deema Alk
Shivam Pandey
No ratings yet
Manideep Lenkalapally
Document7 pages
Manideep Lenkalapally
Noor Ayesha Iqbal
No ratings yet
Brian R. Baker
Document8 pages
Brian R. Baker
Bharathan Baalu
No ratings yet
Vedanth Kunchala Data Integration Engineer
Document4 pages
Vedanth Kunchala Data Integration Engineer
Dummy Gammy
No ratings yet
Minakshi Kesarwani Resume
Document5 pages
Minakshi Kesarwani Resume
HARSHA
No ratings yet
Bigdata Profile - Kanchana-Updated - Dec
Document3 pages
Bigdata Profile - Kanchana-Updated - Dec
raaman
No ratings yet
Ravali Data Engineer GCP
Document8 pages
Ravali Data Engineer GCP
Md Ali
No ratings yet
Projects Profile Project #4: Tax System. Hadoop Developer Speedway - Enon, Oh. June 2015 To Present
Document4 pages
Projects Profile Project #4: Tax System. Hadoop Developer Speedway - Enon, Oh. June 2015 To Present
rameshborukati
No ratings yet
Projects Profile Project #4: Tax System. Hadoop Developer Speedway - Enon, Oh. June 2015 To Present
Document4 pages
Projects Profile Project #4: Tax System. Hadoop Developer Speedway - Enon, Oh. June 2015 To Present
rameshborukati
No ratings yet
Bhavya Raj B
Document8 pages
Bhavya Raj B
0305vipul
No ratings yet
Jim Xiang: - Santa Clara, CA
Document5 pages
Jim Xiang: - Santa Clara, CA
Nuclear Wife
No ratings yet
Srikanth Gottimukkula Professional Summary
Document3 pages
Srikanth Gottimukkula Professional Summary
Vrahta
No ratings yet
Devinder Gill - DE - Resume
Document5 pages
Devinder Gill - DE - Resume
ashish ojha
No ratings yet
Nagaraju Bachu
Document6 pages
Nagaraju Bachu
Vamsi Ramu
No ratings yet
Resume Yogeshdarji
Document1 page
Resume Yogeshdarji
api-324175597
No ratings yet
Module 4 - Pig
Document65 pages
Module 4 - Pig
Aditya Raj
No ratings yet
Jyostna DataEngineer GCEAD
Document5 pages
Jyostna DataEngineer GCEAD
Nishant Kumar
No ratings yet
Churned A Matrix V 3
Document29 pages
Churned A Matrix V 3
Pallav Anand
No ratings yet
Churned A Matrix V 5
Document30 pages
Churned A Matrix V 5
Pallav Anand
No ratings yet
Interview Questions Experiences
Document9 pages
Interview Questions Experiences
Pallav Anand
No ratings yet
Churned A Matrix V 3
Document29 pages
Churned A Matrix V 3
Pallav Anand
No ratings yet
Capital One PDF
Document32 pages
Capital One PDF
Pallav Anand
No ratings yet
Data Processing With Dplyr Tidyr
Document25 pages
Data Processing With Dplyr Tidyr
Pallav Anand
No ratings yet
Churned A Matrix V 3
Document29 pages
Churned A Matrix V 3
Pallav Anand
No ratings yet
TX DL 2016
Document96 pages
TX DL 2016
vreddy123
No ratings yet
Pallav Report
Document5 pages
Pallav Report
Pallav Anand
No ratings yet
No of Customers Who Are Churned in Each Customer - Group ABCDE
Document27 pages
No of Customers Who Are Churned in Each Customer - Group ABCDE
Pallav Anand
No ratings yet
Dates Times
Document12 pages
Dates Times
Pallav Anand
No ratings yet
Lab 07
Document15 pages
Lab 07
Pallav Anand
No ratings yet
Machinelearningsalon Kit 28-12-2014
Document155 pages
Machinelearningsalon Kit 28-12-2014
Vladimir Podshivalov
No ratings yet
Artofdatascience PDF
Document162 pages
Artofdatascience PDF
orchid21
No ratings yet
Caltrain Fares Effective 2-28-16113161316
Document1 page
Caltrain Fares Effective 2-28-16113161316
Pallav Anand
No ratings yet
Biological New
Document5 pages
Biological New
Pallav Anand
No ratings yet
Resume Ahmed Zaidi
Document2 pages
Resume Ahmed Zaidi
Pallav Anand
No ratings yet
Students As Form
Document2 pages
Students As Form
Pallav Anand
No ratings yet
Isen
Document4 pages
Isen
Pallav Anand
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
Document36 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
Pallav Anand
No ratings yet
3 2 Review Sampling, CI
Document24 pages
3 2 Review Sampling, CI
Pallav Anand
No ratings yet
6 Chi Square
Document3 pages
6 Chi Square
Pallav Anand
No ratings yet
4 Hypo
Document4 pages
4 Hypo
Pallav Anand
No ratings yet
2 Hypo Testing
Document4 pages
2 Hypo Testing
Pallav Anand
No ratings yet
3.1 Hypothesis Testing (Critical Value Approach) : Statistics
Document3 pages
3.1 Hypothesis Testing (Critical Value Approach) : Statistics
Pallav Anand
No ratings yet
TMA GuidelinesAndAdvice v3
Document2 pages
TMA GuidelinesAndAdvice v3
Harry Warner
No ratings yet
Basic Computer Class: Lesson 4 Using Email
Document20 pages
Basic Computer Class: Lesson 4 Using Email
Selvaraju Parthibhan
No ratings yet
Frequently Asked Questions: Reminder Notifications', and Send Notifications For Purchasing Documents
Document2 pages
Frequently Asked Questions: Reminder Notifications', and Send Notifications For Purchasing Documents
Nadipalli
No ratings yet
KT BiometricTroubleshooting V1.0
Document12 pages
KT BiometricTroubleshooting V1.0
FCI chhola Office bhopal
No ratings yet
V8.01 Software Manual of The Time Attendance
Document21 pages
V8.01 Software Manual of The Time Attendance
Srbodkhe
100% (1)
80305A - Supply Chain Foundation Appendix For Microsoft Dynamics AX 2012 R2
Document125 pages
80305A - Supply Chain Foundation Appendix For Microsoft Dynamics AX 2012 R2
imroz_alam
No ratings yet
Movicon 11 Programmer Guide PDF
Document662 pages
Movicon 11 Programmer Guide PDF
JuanIgnacioRuizRivera
No ratings yet
Voice Modem Package Contents: Avtech'S Voice Modem Connects Device Manager'S Host
Document5 pages
Voice Modem Package Contents: Avtech'S Voice Modem Connects Device Manager'S Host
Vilasak Itpt
No ratings yet
Bugreport Alioth - Id RKQ1.200826.002 2022 06 20 21 15 13 Dumpstate - Log 9162
Document34 pages
Bugreport Alioth - Id RKQ1.200826.002 2022 06 20 21 15 13 Dumpstate - Log 9162
Setyo Wati
No ratings yet
Cyberoam User Guide PDF
Document475 pages
Cyberoam User Guide PDF
Decio Ramires
No ratings yet
HTML Cheatsheet - CodeWithHarry
Document9 pages
HTML Cheatsheet - CodeWithHarry
michael marco
No ratings yet
Hytrust Keycontrol: Datasheet
Document2 pages
Hytrust Keycontrol: Datasheet
sorinelu007
No ratings yet
COMP 214 DATABASE SYSTEMS NOTES PDF Sem 1 2019 PDF
Document47 pages
COMP 214 DATABASE SYSTEMS NOTES PDF Sem 1 2019 PDF
Victor Kirui
50% (2)
Code HTML
Document12 pages
Code HTML
UmbuJemsy
No ratings yet
Introduction To Python (Concepts and Discussion)
Document25 pages
Introduction To Python (Concepts and Discussion)
MarkG Miguel
No ratings yet
Ch6 - Operating System Forensics
Document49 pages
Ch6 - Operating System Forensics
Sarthak Gupta
No ratings yet
NI VeriStand Target Support User Manual
Document37 pages
NI VeriStand Target Support User Manual
sinq57
No ratings yet
Bibliographic Data Migration From Libsys To Koha
Document8 pages
Bibliographic Data Migration From Libsys To Koha
Humayun
No ratings yet
4it1 01 Que 20231110
Document24 pages
4it1 01 Que 20231110
alq489862
No ratings yet
Senior Quality Assurance Engineer in New York City Resume Nancy Summers
Document3 pages
Senior Quality Assurance Engineer in New York City Resume Nancy Summers
NancySummers2
No ratings yet
To The Point by Prof Aftab
Document203 pages
To The Point by Prof Aftab
Syed Musaiyab Haider Shah
No ratings yet
MSQL For Pentester - Nmap - 1
Document9 pages
MSQL For Pentester - Nmap - 1
huskybroyi.vo.j.1.1.04
No ratings yet
Introduction To Weka
Document39 pages
Introduction To Weka
Faiz Dar
No ratings yet
Tensor Board For Model Debugging and Visualization
Document5 pages
Tensor Board For Model Debugging and Visualization
anhtu12st
No ratings yet
Syllabus of CCME Exam
Document3 pages
Syllabus of CCME Exam
niraj kumar
No ratings yet
Arcgis Desktop Tips (1) - Esri
Document10 pages
Arcgis Desktop Tips (1) - Esri
Syiro Marsyikid
No ratings yet
Information Security MCQ PDF
Document5 pages
Information Security MCQ PDF
Aniket Kanade
No ratings yet