-
Data Scientist
- Capgemini (Austin, TX)
-
Sogeti is a leading provider of professional technology services, specializing in Application Management, Infrastructure Management and High-Tech Engineering. Sogeti offers cutting-edge solutions around Testing, Business Intelligence, Mobility, Cloud and Security, combining world class methodologies and the global delivery model, Rightshore®. Sogeti brings together more than 20,000 professionals in 15 countries and is present in over 100 locations in Europe, the US and India. Sogeti is a wholly-owned subsidiary of Cap Gemini S.A., listed on the Paris Stock Exchange.
At Sogeti USA, we are committed to building a long and enduring relationship with our employees and to creating an environment that rewards and empowers. Our mission is to constantly exceed our employees' expectations in the same way that we strive to exceed our clients' expectations. We offer an environment that celebrates innovation and helps you to achieve a good balance between your professional and personal life. We strive to be an employer of choice!
What You-ll Do
Establishing a single, end-to-end synaptic-based model (for
evaluation) and being prepared to hook it up for model deployment for online
testing
The candidate will come in having experience in modeling work at a production
level, this candidate will not be doing any research-related type of work.
System understanding + modeling
deliverables are key stills for this role. This person should have a sense of
things they build in preparation for production and be able to articulate their
ideas.
Dive into the data to understand its structure, volume, and any existing
preprocessing. This may involve:
Data Cleaning: Handling missing values, removing duplicates, and standardizing
formats.
Exploratory Data Analysis (EDA): Analyzing patterns and distributions,
assessing feature relevance, and identifying potential biases.
Familiarity with data nuances and readiness to create initial embeddings.
Generate embeddings, likely using pre-trained or fine-tuned language models.
Embeddings
for retrieval and have evaluation results, evaluating results of the model and
have that artifact ready for deployment understanding the data, and build
indexes
Minimum education qualification
+ Bachelor's Degree in Computer Science, Computer Engineering, MIS or related field.
The ideal candidate:
MUST HAVE:
+ **SQL- programming language - proficient**
+ **Java code - required**
+ **Python - expert**
+ **Code versioning software (Git)**
Machine learning:
+ Deep learning (including large language models (LLM) and/or computer vision)
+ Data pipeline engineering
+ Model deployment /development (write a model from scratch, infrastructurepipelining)
Nice to have:
+ Some experience in search (Lucene)
+ Some experience with Synaptic-based Language Modeling and Vector Databases(VectorDB)
The benefits our employees enjoy:
+ 401(k) Savings Plan- Matched 150% up to 6%. (Our 401k is in the top 1% of 401(k) plans offered in the US!)
+ Medical/Prescription/Dental/Vision Coverage!
+ Low-premium and deductible. Plan with free preventive care.
+ $12,000 in Tuition Reimbursement
+ 100% Company-paid mobile phone plan
+ Personal Time Off (PTO)- Ensuring a balance of work and home life
Please be aware that Capgemini Sogeti may capture your image (video or screenshot) during the interview process and that image may be used for verification, including during the hiring and onboarding process
Sogeti is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.
-
Recent Jobs
-
Data Scientist
- Capgemini (Austin, TX)
-
Senior DevOps Engineer - Hybrid, Minnesota
- Entrust (Shakopee, MN)