Several issues around the flat object type

### Is your feature request related to a problem? Please describe

In #6507, we add the flat object type. In current implementation, we use two stages to process the flat_object field in the document. First we use the `JsonToStringXContentParser` to collect all the keys and values (`keyList`, `valueList` and `valueAndPathList`) in the field and convert to `XContentParser` for return. The Lucene fields are then constructed by parsing the fields in the `XContentParser`.

https://github.com/opensearch-project/OpenSearch/blob/d6bda7dca75dc4d328b61abcd21b35a6bdfcab05/server/src/main/java/org/opensearch/common/xcontent/JsonToStringXContentParser.java#L80-L85

For a field of flat_object type in a document, the following internal fields will be created by default:
- root StringField and SortedSetDocValuesField for each subfield key(prefiexed by root field name);
- `value` StringField and SortedSetDocValuesField for each subfield value;
- `valueAndPath` StringField and SortedSetDocValuesField for each subfield;
- `_field_name` StringField for each `value` and `valueAndPath`.

```
PUT test
{
  "mappings": {
    "properties": {
      "field1": {
        "properties": {
          "field2": {
            "type": "flat_object"
          }
        }
      }
    }
  }
}

PUT test/_bulk
{"index": {}}
{"field1": {"field2": {"a": "1", "b": "2"}}}
```
For example, the request above generates the fields listed below.
![image](https://github.com/user-attachments/assets/7266ec0a-8b5b-4dae-9f49-e1d29e0e5ded)


There are several issues around the flat object field.
1. If a subfield in the flat_object field suffixed by `VALUE_SUFFIX` (`._value`) or `VALUE_AND_PATH_SUFFIX` (`._valueAndPath`),  **some extra unexpected field may be created**.

https://github.com/opensearch-project/OpenSearch/blob/d6bda7dca75dc4d328b61abcd21b35a6bdfcab05/server/src/main/java/org/opensearch/index/mapper/FlatObjectFieldMapper.java#L660-L669

2. We use '=' to concat subfield key and value, if a subfield key contains '=', **the prefix query may return wrong results**. 

3. **The root fields is confusing and unnecessary**. AFAIK, the root field is use to execute exist query and build fielddata, but it doesn't be generated correctly. For example, if we have document `{"field1": {"field2": {"field3": {"a": "1", "b": "2"}}}}`, and `field2` is flat_object field. After processed, the root fields contains values `field1.field2.a`, `field1.field2.b` and `field1.field2.field3`. The exist query of `field1.field2.field3.a` doesn't return correct result. On the other hand, I don't know is there any meaning to aggregate or sort on the subfield keys. In fact, I don't think that we need to support aggs on flat_object field, it's a object, not a scalar value. If we do need, then we should aggregate on the subfield values, not the subfield keys. Of course, it still makes sense to aggregate subfields, we can utilize the valueAndPath field to support this.

4. **Creating `_field_name` field for `value` and `valueAndPath` is meaningless**. The `_field_name` field is used by exist query, we just need to create it for each full leaf path of subfield.

5. **The value of SortedSetDocValuesField of `value` and `valueAndPath` has unnecessary prefix**. When create SortedSetDocValuesField, we use root field name as the prefix of value.

6. **Two-stage processing is unnecessary**. In the process of converting to JSON strings, we use a lot of additional resources, this is really unnecessary, we can add the corresponding field to parse context during the process.

### Describe the solution you'd like

1. Use one-stage processing;
2. Remove the [FlatObjecField](https://github.com/opensearch-project/OpenSearch/blob/d6bda7dca75dc4d328b61abcd21b35a6bdfcab05/server/src/main/java/org/opensearch/index/mapper/FlatObjectFieldMapper.java#L95);
3. Support the aggs on the subfield, but not the root field, which means the fielddata is not supported on the root field but the subfield;
4. For the indices created after 2.18.0, remove the prefix of the value of SortedSetDocValuesField.

In addition, I have no good idea to fix the issue 2, any suggestions about this or the overall issue are welcome.

### Related component

Search

### Describe alternatives you've considered

_No response_

### Additional context

_No response_

	builder.field(this.fieldTypeName, new HashSet<>(keyList));
	builder.field(this.fieldTypeName + VALUE_SUFFIX, new HashSet<>(valueList));
	builder.field(this.fieldTypeName + VALUE_AND_PATH_SUFFIX, new HashSet<>(valueAndPathList));
	builder.endObject();
	String jString = XContentHelper.convertToJson(BytesReference.bytes(builder), false, MediaTypeRegistry.JSON);
	return JsonXContent.jsonXContent.createParser(this.xContentRegistry, this.deprecationHandler, String.valueOf(jString));

	if (valueType.equals(VALUE_SUFFIX)) {
	if (valueFieldMapper != null) {
	valueFieldMapper.addField(context, value);
	}
	}
	if (valueType.equals(VALUE_AND_PATH_SUFFIX)) {
	if (valueAndPathFieldMapper != null) {
	valueAndPathFieldMapper.addField(context, value);
	}
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Several issues around the flat object type #16061

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Several issues around the flat object type #16061

Description

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions