Large language model performance matrix
Learn how different models perform on different tasks in Elastic Security.
This table describes the performance of various LLMs for different use-cases in Elastic Security, based on our internal testing. To learn more about these use-cases, refer to Attack discovery or AI Assistant.
Feature: | Model | |||||
---|---|---|---|---|---|---|
Claude 3: Opus | Claude 3: Sonnet | Claude 3: Haiku | GPT-4o | GPT-4 Turbo | GPT-4 32K | |
Assistant: general | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent |
Assistant: ES|QL generation | Great | Great | Poor | Excellent | Poor | Excellent |
Assistant: alert questions | Excellent | Excellent | Excellent | Excellent | Poor | Good (limited context) |
Attack Discovery | Excellent | Great | Poor | Poor | Good | Good (limited context) |