Data Scientist

New York, NY

Post Date: 08/09/2018 Job ID: 11127307 Industry: IT Perm
 

Job Description
We are looking for Data Scientists who are natural and relentless problem solvers and are passionate about working with Big Data from data wrangling to Spark programming to sophisticated modeling and machine learning on Hadoop. This full-time position involves data engineering and statistical modeling for both Big Data analytical frameworks as well as survey sourced data for the financial services industry.
Responsibilities
  • • Develop insights into customer behavior and introduce new approaches to transform complex behavioral data into actionable information using Big Data engineering and Machine Learning techniques
  • • Collaborate with modeling teams to drive development of new metrics, models and algorithms using, enhance data governance and drive variable standardization
  • • Leverage enhanced data mining and visualization techniques and understand different data sources relevant for a diverse set of data analysis requirements
  • • Assist in the analytic design of projects including defining issues, developing hypotheses, sample sizing and selection, suggesting appropriate analytical tools and techniques for various classes of problems
  • • Manage all aspects of the analysis including data compilation, programming and development of reports and presentation materials for project delivery
  • • Work closely with IT to build infrastructure to leverage new data sources to drive business growth
 
Desired Skills and Expertise
Candidates should have the following background, skills and characteristics:
  • • Advanced degree in a quantitative discipline required (minimum of master' s is preferred; bachelor' s degree with work experience will be considered)
  • • 2 years of quantitative modeling experience preferably using a distributed computing environment
 
 
  • • Strong knowledge of statistical and machine learning techniques, such as regression analysis, clustering, decision trees, collaborative filtering, k-nearest neighbors, support vector machines, association rules, and matrix factorization methods
  • • Working knowledge of R, Python, SAS and Java (bonus points for Scala) and proficiency in SQL
  • • Deep understanding of experiment design: Factorial experiment design, multivariate experiments, simple random sampling vs stratified, etc.
  • • Strong cross-functional communication skills to be able to explain your approach to a wide variety of both technical and non-technical stakeholders
  • • Ability to work well with others in a high-pressure environment
  • • Capable of carrying out multiple tasks or projects in parallel
  • • Good oral and written communication skills
  • • Familiarity with Hadoop, and Spark on YARN for working with terabyte-scale data

Not ready to apply?

Send an email reminder to:

Share This Job:

Related Jobs: