Automatic schema discovery is a useful feature that allows for accurate data type assignment to individual columns within a dataset, especially when the data source is a local file.
When a provider uses a local file as its data source and doesn't have a manual API_config for column data types, each column is treated as a string data type by default. This approach is used because it can accommodate all data types and allows for flexibility in casting the data to preferred types at a later stage.
To enable automatic schema discovery, it is important to configure two essential connection string parameters:
2. RowsToScan