Assist in identifying, collecting, and extracting structured and unstructured data from various sources (e.g., databases, APIs, log files) using tools like
SQL
.
Data Cleaning and Preprocessing:
Clean, transform, and preprocess raw data to ensure quality, consistency, and suitability for analysis. This involves handling missing values, outliers, and data standardization.
Exploratory Data Analysis (EDA):
Conduct initial
Exploratory Data Analysis
(EDA) using statistical methods and programming languages like
Python
or
R
to uncover patterns, identify trends, and understand data distributions.
Model Development Support
Algorithm Application:
Apply fundamental
Machine Learning
algorithms (e.g., linear regression, classification, clustering) to solve defined business problems.
Model Training and Testing:
Assist in training, testing, and evaluating the performance of basic predictive models.
Statistical Analysis:
Perform statistical hypothesis testing and basic quantitative analysis to draw conclusions from data.
Communication and Collaboration
Visualization and Reporting:
Create clear and concise
data visualizations
(e.g., charts, graphs, dashboards using tools like Tableau or Matplotlib) to communicate findings to technical and non-technical stakeholders.
Cross-functional Collaboration:
Work closely with data analysts, data engineers, and business teams to understand project requirements and align data solutions with business objectives.
Problem-Solving:
Utilize a curious, analytical, and
problem-solving mindset
to propose and investigate data-driven solutions to business challenges.
Expected skillset
Bachelor's or Master's degree in Computer science, Data Science or related field
Familiarity with cloud platforms for scalable data processing and storage, such as Microsoft Azure or AWS
Proficiency in Python libraries; Pandas (data manipulation), Numpy (numerical operations), Sci-kit learning (machine learning) and SQL for extracting data.
Understanding and implementing common ML algorithms (e.g., linear/logistic regression, decision trees, clustering).
Train machine learning and deep learning models using appropriate algorithms (linear, tree-based, boosting, neural nets, time-series & NLP methods).
A strong grasp of statistical concepts (e.g., hypothesis testing, probability distributions, regression analysis) to interpret data and validate models.
Validate via cross-validation/backtesting and perform robustness, bias, and fairness checks.
Write tests for data pipelines, deploy and productionize models (Docker, APIs), and maintain model registry/versioning.
Creating effective charts, graphs, and dashboards using tools like Matplotlib, Seaborn, Tableau, or Power BI to communicate insights.
Job Type: Fresher
Pay: ₹10,000.00 - ₹12,500.00 per month
Work Location: In person
Beware of fraud agents! do not pay money to get a job
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.