Masayuki Takigahira

Forum Replies Created

Viewing 7 posts - 61 through 67 (of 67 total)
  • Author
  • in reply to: Sound Source Localization with suppression of constant noise #437

    I expect that the cause is the following.
    IterCount is a node that outputs frame count.
    In the cookbook the constant node uses 200 frames. Every 1 frame can be considered 10 ms. So 200 is the same as 2 seconds. (If ADVANCE is 160 samples in case of the 16 kHz sampling, one frame is 10 ms.)
    In other words, input data must be greater than 2 seconds.
    You can either lower the frames to match the length of the input file.
    Or you can increase the length of the input file to match the frame count.

    The detailed behavior of CMMakerFromFFTwithFlag is described in the HARK document. The HTML version can be found at the following URL.

    Also, although it is irrelevant to the calculation result, old description may remain in the cookbook manual.
    For example, since the format saved by CMSave has been changed to zip format, it is preferable to make the file name extension zip.

    Best regards,

    in reply to: Sound Source Localization with suppression of constant noise #430

    >> It is the name of my input.wav?

    If type is set to string and file name is entered directly in the text box, processing will be done with the specified file.
    On the other hand, if you set the type to subnet_param and enter the string ARG1 in the text box you can specify the file at startup.
    This has the same effect as $1 written in the shell script.
    In other words, when you write ARG2, the contents specified by the 2nd argument in the runtime are reflected.
    It is often used when you want to perform the same processing with HARK for various input data.
    This method can be used on many other nodes, but sometimes it is necessary to explicitly indicate the type.
    In that case, specify as :ARG like string:ARG1, float:ARG3 or etc…

    Best regards,

    in reply to: KaldiDecoder(v2.4.0)で認識結果が返って来ない #428


    OS : Ubuntu 16.04.03
    CPU : Intel Core i7-7700@3.60GHz [Turbo Boost:4.2GHz], 4 cores(8 threads)
    Mem : 64GB


    in reply to: KaldiDecoder(v2.4.0)で認識結果が返って来ない #424








    in reply to: Sound Source Localization with suppression of constant noise #419

    Please set the VALUE parameter of Constant node of MAIN as follows.
    – Set type to subnet_param.
    – Enter ARG1 in the text box.
    If you execute network file on terminal, you can give an input WAV file by specifying a file name with the first argument.

    e.g.) ./network.n ./your_input.wav

    Please set no values to all parameters of InputStream node in MAIN.

    Best regards,

    in reply to: Thresh parameter in SourceTracker node #418

    SourceTracker’s Thresh is a parameter that judges whether the MUSIC spectrum power
    of the node connected to the preceding stage such as LocalizeMUSIC or not exceeds the threshold value.
    It is difficult to make physical meaning to the value of the MUSIC spectrum itself.
    Therefore, there is no unit.

    For MUSIC spectral power when LocalizeMUSIC is connected, please refer to the following formula (15) and (16) in the document.

    Thresh differs depending on the user’s environment, so there is no recommended value,
    but you can know roughly the proper value by the following method.
    1. If you set DEBUG parameter to true on the LocalizeMUSIC node, the MUSIC spectrum power in each direction included in the transfer function is output to the stdout.
    2. In the result of this stdout, the value of the column whose numerical value rises when you speaking is the value of the MUSIC spectrum power in the sound source direction.
    3. You can check the average of the values shown during the speaking periods and the average of the values shown during the silence periods.
    4. You fine-adjust the intermediate value of the two values confirmed in step 3. as the initial value of SourceTracker’s Thresh parameter.

    Best regards,

    in reply to: C++ code of the nodes #413

    – Case.1
    If you are using Ubuntu distribution, you can easily get source codes using the following method.

    apt-get source <package-name>

    e.g.) In case of the HARK basic package’s source codes and HARK-Python source code required.
    apt-get source harkfd hark-sss hark-python

    – Case.2
    When you need an older version than the one you are currently releasing,
    If you are using another Linux distribution (e.g. Debian, Mint, Fedora, Cent OS, Red Hat, Vine, etc…)
    you can download source code in tar.gz or tar.xz format from the following location.
    However, we do not officially support other than Ubuntu.
    If you use an OS other than Ubuntu, you may need to change or/and<code-name>/non-free/source/

    e.g.) Xenial, Trusty, Precise are as follows.

    If you need a more older version please download from the bottom of the bottom on this page.

    Best regards,

Viewing 7 posts - 61 through 67 (of 67 total)