Responsibilities
- Explore and examine data from multiple diverse sources.
- Perform conceptual modeling, statistical analysis, and predictive modeling.
- Conduct data cleanup, normalization, and transformation.
- Develop hypotheses and test them with careful experiments.
- Build workflows for Extraction, Transformation, and Loading (ETL) of data.
- Ensure the integrity and security of data.
Qualifications
- Bachelor’s or Master’s degree in Data Science, Mathematics, or Computer Science.
- Knowledge of Probability Theory, Inference, and Linear Algebra.
- Experience in Classification, Prediction, and Clustering tasks.
- Strong skills in Python and packages like NumPy, pandas, scikit-learn, TensorFlow, PyTorch.
- Ability to write clear and concise code.
- Familiarity with Transformers and LangChain is a plus.