Important recommendations for getting started with direct query

Focus mode

Important recommendations for getting started with direct query - Amazon OpenSearch Service

General information

We recommend you do the following when using direct query:

Use the COALESCE SQL function to handle missing columns and ensure results are returned.

Use limits on your queries to ensure you aren't pulling too much data back.
If you plan to analyze the same dataset many times, create an indexed view to fully ingest and index the data into OpenSearch and drop it when you have completed the analysis.
Drop acceleration jobs and indexes when they're no longer needed.
Queries containing field names that are identical but differ only in case (such as field1 and FIELD1) are not supported.

For example, the following queries are not supported:
```
Select AWSAccountId, AwsAccountId from LogGroup
Select a.@LogStream, b.@logStream from Table A INNER Join Table B ona.id = b.id
```
However, the following query is supported because the field name (@logStream) is identical in both log groups:
```
Select a.@logStream, b.@logStream from Table A INNER Join Table B on a.id = b.id
```
Functions and expressions must operate on field names and be part of a SELECT statement with a log group specified in the FROM clause.

For example, this query is not supported:
```
SELECT cos(10) FROM LogGroup
```
This query is supported:
```
SELECT cos(field1) FROM LogGroup
```

If you’re using Amazon OpenSearch Service to direct query data in Amazon S3, we also recommend the following:

Ingest data into Amazon S3 using partition formats of year, month, day, hour to speed up queries.
When you build skipping indexes, use Bloom filters for fields with high cardinality and min/max indexes for fields with large value ranges. For high-cardinality fields, consider using a value-based approach to improve query efficiency.
Use Index State Management to maintain storage for materialized views and covering indexes.

If you’re using Amazon OpenSearch Service to direct query data in CloudWatch Logs, we also recommend the following:

When searching multiple log groups in one query, use the appropriate syntax. For more information, see Multi-log group functions.
When using SQL or PPL commands, enclose certain fields in backticks to successfully query them. Backticks are needed for fields with special characters (non-alphabetic and non-numeric). For example, enclose @message, Operation.Export, and Test::Field in backticks. You don't need to enclose columns with purely alphabetic names in backticks.

Example query with simple fields:
```
SELECT SessionToken, Operation, StartTime  FROM `LogGroup-A`
LIMIT 1000;
```
Similar query with backticks appended:
```
SELECT `@SessionToken`, `@Operation`, `@StartTime` FROM `LogGroup-A`
LIMIT 1000;
```

If you’re using Amazon OpenSearch Service to direct query data in Security Lake, we also recommend the following:

Check your Security Lake status and ensure that it's running smoothly without any problems. For detailed troubleshooting steps, see Troubleshooting data lake status in the Amazon Security Lake User Guide.
Verify your query access:
- If you're querying Security Lake from a different account than the Security Lake delegated administrator account, set up a subscriber with query access in Security Lake.
- If you're querying Security Lake from the same account, check for any messages in Security Lake about registering your managed S3 buckets with LakeFormation.
Explore the query templates and pre-built dashboards to jumpstart your analysis.
Familiarize yourself with Open Cybersecurity Schema Framework (OCSF) and Security Lake:
- Review schema mapping examples for AWS sources in the OCSF GitHub repository
- Learn how to query Security Lake effectively by visiting Security Lake queries for AWS source version 2 (OCSF 1.1.0)
- Improve query performance by using partitions: accountid, region, and time_dt
Get comfortable with SQL syntax, which Security Lake supports for querying. For more information, see Supported OpenSearch SQL commands and functions.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Limitations

Direct queries in S3