Principal Data Scientist
- Islamabad, Punjab, Pakistan
- Full-time
- Delivery
We are seeking a highly skilled Data Scientist (5–8 years of experience) to drive data-driven insights, develop predictive models, and build scalable data preparation pipelines. The ideal candidate will be proficient in Python or R, statistical analysis, and traditional machine learning techniques, with hands-on experience in leveraging Oracle data platforms for analytical workloads.
This role involves conducting exploratory data analysis (EDA), feature engineering, and model development, along with collaborating closely with engineering and business teams to deliver impactful and production-ready data science solutions.
Key Responsibilities
- Perform exploratory data analysis (EDA) to uncover trends, patterns, and anomalies in large datasets.
- Apply statistical methods and hypothesis testing to generate actionable insights.
- Design, build, and validate machine learning models (regression, classification, clustering, etc.).
- Develop scalable data preparation and feature engineering pipelines for model training.
- Optimize model performance through hyperparameter tuning, feature selection, and validation.
- Work with Oracle databases, Oracle SQL, and related tools to extract, transform, and analyze data.
- Collaborate with data engineers and BI teams to integrate models into business applications and reporting layers.
- Evaluate model outcomes using appropriate metrics and refine models for deployment.
- Stay abreast of emerging trends in data science, ML, and AI, continuously enhancing analytical methodologies.
- Present findings and recommendations to business stakeholders in a clear and actionable format.
Required Skills & Experience
- Bachelor’s or equivalent degree in Data Science, Computer Science, Information Technology, or a related field.
- 5–8 years of hands-on experience in data science, machine learning, or statistical analysis.
- Strong programming proficiency in Python or R for data analysis and model building.
- Solid understanding of machine learning algorithms (e.g., regression, decision trees, random forests, SVMs).
- Proficiency in statistical analysis, hypothesis testing, and probability theory.
- Experience in data preprocessing, feature engineering, and dimensionality reduction.
- Hands-on experience with ML libraries and frameworks (Scikit-learn, XGBoost, LightGBM, Statsmodels, etc.).
- Strong SQL skills, including experience working with Oracle Database and Oracle Analytics for data extraction and transformation.
- Experience handling large-scale structured and unstructured datasets.
- Exposure to model deployment, monitoring, or integration with enterprise systems.
- Excellent problem-solving, analytical, and communication skills with the ability to translate complex insights for business users.
Preferred Qualifications
- Experience working within Oracle data ecosystems (e.g., Oracle Autonomous Database, Oracle Data Science, or Oracle Analytics Cloud).
- Familiarity with big data technologies (Spark, Hadoop) for large-scale ML workloads.
- Experience with deep learning frameworks (TensorFlow, PyTorch).
- Exposure to MLOps, including CI/CD pipelines, model versioning, and lifecycle management.
- Understanding of A/B testing, experiment design, and data validation techniques.
We have an amazing team of 700+ individuals working on highly innovative enterprise projects & products. Our customer base includes Fortune 100 retail and CPG companies, leading store chains, fast-growth fintech, and multiple Silicon Valley startups.
What makes Confiz stand out is our focus on processes and culture. Confiz is ISO 9001:2015 (QMS), ISO 27001:2022 (ISMS), ISO 20000-1:2018 (ITSM), ISO 14001:2015 (EMS), ISO 45001:2018 (OHSMS) Certified. We have a vibrant culture of learning via collaboration and making workplace fun.
People who work with us work with cutting-edge technologies while contributing success to the company as well as to themselves.
To know more about Confiz Limited, visit: https://www.linkedin.com/company/confiz-pakistan/
