intel/skylake: nhlt: Add capture config for echo ref stream for Max98373 Codec

During Speaker playback, quad Channel I/V feedback data is
captured from SSP0 Rx. Out of these 4-channels, Stereo V-Sense data
needs to be given as echo ref stream.
So, adding stereo capture config to max98373_capture_formats.

TEST='Audio playback and Capture Stereo echo ref data'

