-
Nearshore- Data Scientist
- Insight Global (Houston, TX)
-
Job Description
- AI Engineer, Data Engineer/Scientist
- AI engineering/data engineering space, Python, SQL, Cloud
- Supports projects within data analytics and machine learning.
- Extensive background in Machine Learning, Python, and Pandas.
CERTIFICATES, LICENSES, REGISTRATIONS
- Preferred: Databricks Certified Associate Developer for Apache Spark, AWS Certified Solutions Architect, or other relevant certifications.
We are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to [email protected] . The EEOC "Know Your Rights" Poster is available here (https://www.eeoc.gov/sites/default/files/2023-06/22-088\_EEOC\_KnowYourRights6.12ScreenRdr.pdf) .
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .
Skills and Requirements
1. Exploratory Data Analysis (EDA)
Proficiency in Pandas, NumPy, Matplotlib, Seaborn, and Plotly
Strong understanding of data profiling, missing value treatment, and outlier detection
Experience with hypothesis testing, statistical summaries, and correlation analysis
Skilled in data visualization for both technical and non-technical stakeholders
Demonstrated business sense with the ability to understand domain problems and incorporate insights into solution design
Proven ability to derive actionable insights from raw and messy data
2. Data Science Solution Design
Proficient in machine learning frameworks (e.g., Scikit-learn, XGBoost, DNN)
Proficient in building end-to-end ML pipelines: data ingestion, preprocessing, modeling, evaluation, and deployment
Familiarity with feature engineering
Experience with time series analysis or spatial data
3. Full Stack & Engineering Skills
Knowledge of SQL and data modeling
Familiar with Databricks and Pyspark
Familiarity with Python-based APIs (e.g., Flask, FastAPI)
Experience with cloud platforms (e.g., AWS, GCP, Azure)
Version control using Git
4. Preferred
Experience working with multi-agent systems, adaptive learning, or optimization problems null
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal employment opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion,sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by applicable laws, regulations, andordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to [email protected].
-