-
Senior Data Architect - AI-Powered Data Platforms
- GE Vernova (Niskayuna, NY)
-
Job Description Summary
We are seeking an experienced Data Architect who specializes in modernizing enterprise data platforms for the AI era. This role requires someone who deeply understands both traditional data architectures and the emerging requirements of AI systems, with expertise in bridging existing data lakes to support modern AI capabilities like RAG (Retrieval-Augmented Generation), vector search, and multi-modal AI applications. You'll be the architect who transforms our wealth of structured and unstructured data assets into AI-ready infrastructure.
The ideal candidate will have 10+ years of experience with enterprise data platforms and proven expertise in handling both structured and unstructured data at scale. You understand the complexities of existing data lake architectures and can architect the evolution path to support AI workloads without disrupting current operations.
As a GE Vernova accelerator, GE Vernova Advanced Research is driving strategy and leading research & development efforts to execute on the business's mission to help power the energy transition. We forge the collaborations and help invent the technologies required to electrify and decarbonize for a zero-carbon future.
Representing virtually every major scientific and engineering discipline, our researchers are collaborating with GE Vernova's businesses, the U.S. government, and more than 420 entities at the forefront of technology to execute on 150+ energy-focused projects. Collectively, these research programs and initiatives aim to solve near term technical challenges, deliver next generation product advances, and drive long term breakthrough innovation to enable more affordable, reliable, sustainable, and secure energy.
Job Description
Unstructured Data & AI Enablement:
+ Design scalable architectures for processing and indexing unstructured data (PDFs, documents, emails, logs, images) for AI consumption
+ Architect document processing pipelines that leverage multi-modal LLMs (GPT-4V, Claude, Gemini) for direct document understanding without traditional OCR preprocessing
+ Implement intelligent document extraction using LLMs' native vision and context capabilities to handle complex layouts, tables, and mixed media
+ Design metadata extraction and enrichment pipelines that enhance discoverability of unstructured assets
+ Build architectures for multi-modal AI applications that combine structured and unstructured data sources
RAG & Knowledge Platform Architecture:
+ Design end-to-end RAG architectures that leverage existing data lakes and enterprise knowledge bases
+ Architect hybrid search systems combining traditional keyword search with semantic/vector search capabilities
+ Implement chunking strategies and embedding pipelines for diverse document types and data sources
+ Build architectures for continuous learning where RAG systems are updated with new data in near real-time
+ Design security and access control models that work across legacy systems and modern AI platforms
+ Create data governance frameworks that ensure compliance while enabling AI innovation
Platform Optimization & Scale:
+ Optimize storage strategies for cost-effective management of structured and unstructured data
+ Design tiered storage architectures that balance performance needs with storage costs
+ Implement caching layers for frequently accessed embeddings and AI model inputs
BASIC QUALIFICATIONS
+ Bachelor's degree in Computer Science, Information Systems, or related field
+ 10+ years of experience as a Data Architect, Data Platform Engineer, or similar role with enterprise data systems
+ 5+ years of experience working with both structured (SQL databases, data warehouses) and unstructured data (documents, logs, multimedia)
+ Understanding of modern document processing using multi-modal LLMs and traditional extraction methods
+ Proficiency in Python and SQL, with experience in data processing libraries
+ Legal authorization to work in the U.S. is required. We will not sponsor individuals at the Bachelor’s level for employment visas, now or in the future, for this job opening.
+ Must be 18 years or older.
+ You must submit your application for employment on the careers page at www. (https://wd5.myworkday.com/ge/d/inst/15$165509/www.gecareers.com) careers.gevernova.com to be considered.
PREFERRED QUALIFICATIONS
+ 12+ years of experience modernizing legacy data architectures for cloud and AI workloads
+ Deep expertise in unstructured data processing using both multi-modal LLMs and traditional methods
+ Experience with multi-modal LLMs for document understanding and their cost/performance trade-offs
+ Background in information retrieval, search engineering, or content management systems
+ Experience with multi-modal AI architectures combining text, image, and structured data
+ Master's degree in Computer Science, Information Systems, or related field
Technical Stack
**Document Processing:** Multi-modal LLMs (GPT-4V, Claude Vision, Gemini), LlamaParse, Unstructured.io, Azure Document Intelligence, AWS Textract (for legacy/high-volume), direct PDF-to-context pipelines
**Vector/Search:** Pinecone, Weaviate, pgvector
**Lake Technologies:** AWS S3, Azure ADLS
**Languages:** Python, SQL, Scala, Java
**APIs:** OpenAI, Anthropic, Google Vertex AI, AWS Bedrock, Azure OpenAI
The salary range for this position is $145,000 - $242,000 USD, annually. The specific salary offered to a candidate may be influenced by a variety of factors including the candidate’s experience, their education, and the work location. This position is also eligible for a performance bonus. This position will remain posted until at least October 5th, 2025.
GE provides a comprehensive benefits package that provides access to plans which support the overall wellbeing of our employees and their dependents. These benefits include, but are not limited to, health care coverage (medical, dental, vision, pharmacy), a retirement plan that includes Company Retirement Savings and a 401K with Company matching, Life Insurance options, Disability coverage, paid time-off, EAP, and more.
GE Vernova offers a great work environment, professional development, challenging careers, and competitive compensation. GE Vernova is an Equal Opportunity Employer (https://www.eeoc.gov/sites/default/files/2022-10/22-088\_EEOC\_KnowYourRights\_10\_20.pdf) . Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
GE Vernova will only employ those who are legally authorized to work in the United States for this opening. Any offer of employment is conditioned upon the successful completion of a drug screen (as applicable).
**Relocation Assistance Provided:** Yes
GE Vernova is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
-
Recent Searches
- Analyst Performance Monitoring Quality (United States)
- Development Coordinator Foundations Government (United States)
- Neuroscience Spine Service Line (Texas)
Recent Jobs
-
Senior Data Architect - AI-Powered Data Platforms
- GE Vernova (Niskayuna, NY)