AI & Enterprise AI9 July 20247 min read
Multimodal AI in the Enterprise — Where Vision Plus Text Earns Its Cost
GPT-4o, Claude 3, Gemini 1.5 brought capable multimodal models to the enterprise. The use cases that justify the cost are narrower than the demos suggest, but the ones that do justify it are worth investing in.
AI & Enterprise AI6 February 20249 min read
Intelligent Document Processing — From OCR to Understanding
Intelligent document processing has changed shape in the last eighteen months. A practitioner view of where the real work sits when LLMs join the pipeline — and why parsing still matters more than the model.