Junior gen ai engineer-aws bedrock, vertex ai
PatnaBryckel AI
...partial re-execution Optimize latency, throughput, and cost for long-context inference (batching, streaming, async execution) Build and scale OCR → document parsing → LLM inference pipelines for scanned leases (Textract) Develop streaming and async APIs using Fast API Manage distributed background workloads with Celery [...]
Category IT & Telecommunications