NatureLM-Audio / static /help.html
dianekim's picture
Update examples, Help tab, and css (#137)
abda905
<div class="banner">
<div style="display: flex; padding: 0px; align-items: center; flex: 1;">
<div style="font-size: 20px; margin-right: 12px;"></div>
<div style="flex: 1;">
<div class="banner-header">Help us improve the model!</div>
<div class="banner-text">
Found an issue or have suggestions?
Join us on Discourse to share feedback and questions.
</div>
</div>
</div>
<a href="https://earthspeciesproject.discourse.group/t/feedback-for-naturelm-audio-ui-hugging-face-spaces-demo/17"
target="_blank" class="link-btn">Share Feedback</a>
</div>
<div class="guide-section">
<h3>Getting Started</h3>
<ol style="margin-top: 12px; padding-left: 20px;
color: #6b7280; font-size: 14px; line-height: 1.6;">
<li style="margin-bottom: 8px;">
<strong>Upload your audio</strong> or click on a pre-loaded example.
Drag and drop your audio file containing animal vocalizations,
or click on an example.
</li>
<li style="margin-bottom: 8px;">
<strong>Trim your audio (if needed)</strong> by clicking the scissors
icon on the bottom right of the audio panel. Only the first 10 seconds of audio will be analyzed, so trim to the most relevant section of your recording.
</li>
<li style="margin-bottom: 8px;">
<strong>View the Spectrogram (optional)</strong>. You can easily
view/hide the spectrogram of your audio for closer analysis.
</li>
<li style="margin-bottom: 8px;">
<strong>Select a task or write your own</strong>. Select an option
from pre-loaded tasks. This will auto-fill the text box with a prompt,
so all you have to do is hit Send. Or, type a custom prompt directly
into the chat.
</li>
<li style="margin-bottom: 0;">
<strong>Send and Analyze Audio</strong>. Press "Send" or type Enter
to begin processing your audio. Ask follow-up questions or press
"Clear" to start a new conversation.
</li>
</ol>
</div>
<div class="guide-section">
<h3>Tips</h3>
<b>Prompting Tips</b> (see full <a href="https://projects.earthspecies.org/naturelm-audio/prompting_guide.html" target="_blank">Prompting Guide</a> for more)
<ul style="margin-top: 12px; padding-left: 20px;
color: #6b7280; font-size: 14px; line-height: 1.6;">
<li><strong>For Yes/No questions, always include "Answer: Yes or No."</strong> Without this, the model may respond with species names rather than a yes or no answer.</li>
<ul>
<li>
<i>Is an alarm call present in this recording? Answer: Yes or No.</i>
</li>
<li>
<i>Is there a frog or amphibian vocalizing in this recording? Answer: Yes or No.</i>
</li>
</ul>
<li><strong>Providing geographic or temporal context</strong> can help narrow identification.</li>
<ul>
<li>
<i>Given the context: '[context]', what is the common name for the focal species in the audio?"</i>
</li>
<li>
Replace [context] with whatever metadata you have, e.g. country: BR, coordinates: -23.5, -46.6 or recorded in temperate forest, June.
</li>
</ul>
<ul>
</ul>
<li><strong>Giving the model a candidate list</strong> to choose from can improve accuracy. </li>
<ul>
<li>
<i>Which of these is the focal species in the audio? Options: [species_choices]</i>
</li><li>
<i>Replace [species_choices] with a comma-separated list, e.g. Turdus merula, Erithacus rubecula, Fringilla coelebs, Parus major, Phylloscopus collybita.</i>
</li>
</ul>
</ul>
<br>
<b>Audio Files</b>
<ul style="margin-top: 12px; padding-left: 20px;
color: #6b7280; font-size: 14px; line-height: 1.6;">
<li>Supported formats: .wav, .mp3, .aac, .flac, .ogg, .webm,
.midi, .aiff, .wma, .opus, .amr</li>
<li>If you are uploading an .mp4, please check that it is not
an MPEG-4 Movie file.</li>
<li>For best results, use high-quality recordings with minimal
background noise.</li>
</ul>
</div>
<div class="guide-section">
<h3>Learn More</h3>
<ul style="margin-top: 12px; padding-left: 20px;
color: #6b7280; font-size: 14px; line-height: 1.6;">
<li>Visit the <a href="https://projects.earthspecies.org/naturelm-audio/prompting_guide.html"
target="_blank">NatureLM-audio Project Page</a>
for more details, examples, and the full Prompting Guide</li>
<li>Read our <a href="https://huggingface.co/blog/EarthSpeciesProject/nature-lm-audio-ui-demo/"
target="_blank">blog post</a> with a step-by-step tutorial</li>
<li>Check out the
<a href="https://arxiv.org/abs/2411.07186"
target="_blank">published paper</a> for a deeper technical dive on NatureLM-audio</li>
<li>Sign up for our <a href="https://forms.gle/WjrbmFhKkzmEgwvY7"
target="_blank">closed beta waitlist</a>, if you're interested in testing upcoming features like longer audio files and batch processing.</li>
</ul>
</div>