with 4 to 7 years of experience to join our dynamic team. This role is pivotal in driving data-driven decision-making by transforming complex business problems into actionable technical solutions. The ideal candidate will leverage expertise in
R Programming Language
,
Python
,
Machine Learning
,
NLP
, and
Data Mining
to develop advanced predictive models and insightful analytics that directly impact quality, risk management, and operational efficiency within our organization.
Roles and Responsibilities:
Translate complex business challenges into clear, executable technical plans and communicate results effectively to stakeholders.
Perform rigorous
statistical analysis
to determine appropriate methodologies based on the problem context.
Design and implement
advanced feature engineering
techniques, including:
Creating predictive features from diverse datasets.
Extracting text-based features using
NLP
to convert unstructured data into numerical insights.
Developing time-series features such as rolling averages and control charts from sensor or operational data.
Constructing interaction features linking multiple datasets to uncover hidden patterns.
Build, validate, and deploy
machine learning
models (e.g.,
random forest
,
XGBoost
) to identify key drivers of quality defects and other critical outcomes.
NLP
techniques to analyze unstructured data sources such as customer complaints, risk management documents, and non-conformance descriptions.
Conduct advanced data manipulation and integration using
SQL
,
Python
, or
R
to prepare datasets for modeling and analysis.
Communicate findings using compelling
data visualization
to inform decision-making across teams.
Collaborate with cross-functional teams including R&D, manufacturing, and quality assurance to ensure data solutions align with business goals.
Maintain compliance with industry regulations and standards pertinent to highly regulated sectors such as pharmaceuticals, medical devices, or finance (preferred).
Support cloud-based data infrastructure initiatives using platforms like
AWS
,
Azure
, or
GCP
(advantageous).
Qualifications:
Bachelor's or Master's degree in Computer Science, Statistics, Mathematics, Engineering, or a related quantitative field.
4 to 7 years of professional experience in
data science
,
machine learning
, or related roles.
Proven expertise in
R Programming Language
and
Python
for data analysis and model development.
Strong background in
statistical analysis
and
data mining
techniques.
Hands-on experience with
NLP
applications on unstructured text data.
Demonstrated ability to build and deploy classification models such as
random forest
and
XGBoost
.
Proficiency in
SQL
and data wrangling using
Python
or
R
.
Experience in feature engineering for time-series and interaction features.
Excellent communication skills with ability to clearly articulate technical results to non-technical stakeholders.
Experience in regulated industries (pharma, medical device, finance) and cloud platforms (
AWS
,
Azure
,
GCP
) is a plus.
Tools and Technologies:
R Programming Language
Python
(including libraries such as pandas, scikit-learn, nltk/spacy for NLP)
SQL
for data querying and manipulation
Machine Learning frameworks (e.g.,
XGBoost
,
random forest
)
Natural Language Processing tools and techniques
Data visualization tools (e.g., Tableau, Power BI, matplotlib, seaborn)
Cloud platforms (
AWS
,
Azure
,
GCP
) - preferred
Join us as a Data Science Engineer and play a crucial role in harnessing the power of data to solve complex business problems end-to-end, driving innovation and quality improvements across the organization.
Skills
R Programming Language, Data Mining, Python, NLP, Machine Learning, Data Science
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.