Thank you for your inquiry.
Please check the following points.
Separate human voices with HARK. In other words, if the sampling rate is sufficient at 16kHz, then:
1. Although the TSP response file is recorded at 48 kHz, please downsample this file to 16 kHz. Because
16384.little_endian.wav is a TSP file for the 0 to 8 kHz.
2. HARKTOOL creates a transfer function from the TSP response file for the 0 to 8 kHz. In other words, it is not necessary to change the paramter settings for the number of samples by HARKTOOL.
3. In the HARK network file, connect the MultiDownSampler node after the AudioStreamFromMic node, and downsample from 48kHz to 16kHz. And, the LocalizeMUSIC and GHDSS nodes use a transfer function for the 0 to 8 kHz.
Normally, 16 kHz is sufficient to process the human voice band.
If you need to separate up to very high frequency bands like electronic sounds. In that case, do not use 16384.little_endian.wav. You need to recreate the TSP file itself for 48kHz. In other words, it is a TSP file up to 24kHz, which is the Nyquist frequency.
In this case, the 786,432 samples that you wrote in the post are correct as calculations.
If you can read Matlab script, my script may be helpful. In my code, TSP file and inverse TSP file are generated by specifying sampling rate etc. The reason for duplicating the channel of TSP file is to ensure that wios does not fail if the playback device is stereo.
I think my script maybe works with Matlab’s Clone (eg octave) too, but I will attach a 48kHz sample just in case.