{"id":827,"date":"2018-04-12T14:50:37","date_gmt":"2018-04-12T13:50:37","guid":{"rendered":"https:\/\/blog.freesound.org\/?p=827"},"modified":"2018-04-12T14:50:37","modified_gmt":"2018-04-12T13:50:37","slug":"introducing-freesound-datasets-and-more","status":"publish","type":"post","link":"https:\/\/blog.freesound.org\/?p=827","title":{"rendered":"Introducing Freesound Datasets (and more!)"},"content":{"rendered":"<p>Dear Freesounders,<\/p>\n<p>Today we are very happy to introduce you to <strong>Freesound Datasets<\/strong>, a new platform that we&#8217;ve been developing during the last year to foster the re-use of Freesound content in research contexts and that will eventually help us make Freesound better and better. Curious? Check out the website at <a href=\"https:\/\/datasets.freesound.org\/\" target=\"_blank\" rel=\"noopener\">https:\/\/datasets.freesound.org\/<\/a>.<\/p>\n<p><a href=\"https:\/\/blog.freesound.org\/wp-content\/uploads\/2018\/04\/ad029733-1371-8e19-f1cc-95ae837669ee.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-828\" src=\"https:\/\/blog.freesound.org\/wp-content\/uploads\/2018\/04\/ad029733-1371-8e19-f1cc-95ae837669ee.png\" alt=\"\" width=\"619\" height=\"132\" \/><\/a><\/p>\n<p><em>But what exactly is a dataset?<\/em> To say it short, a dataset is a collection of items (sounds) annotated with labels chosen from a limited vocabulary of concepts. Well-curated datasets are one of the most important things that are needed to advance research in many fields, including sound and music related research.<\/p>\n<p>Freesound Datasets is a platform that allows users to explore the contents of datasets made with Freesound sounds. But even more importantly, Freesound Datasets allows anyone to <strong>help make the datasets better by providing new annotations<\/strong>. Furthermore, it also promotes discussion about the datasets that it hosts, and allows (or better said, <em>will allow<\/em>) anyone to download different timestamped versions of them. If you&#8217;d like a more <em>academic<\/em> description about the platform, you can check out this paper we presented at the?<em>International Society for Music Information Retrieval Conference<\/em> last year: <a href=\"http:\/\/mtg.upf.edu\/node\/3827\">Freesound Datasets: A Platform for the Creation of Open Audio Datasets<\/a>.<\/p>\n<p>Using Freesound Datasets, we already started creating a first dataset which we called <strong>FSD<\/strong>. FSD is a big, general-purpose dataset composed of Freesound content and annotated with labels from Google?s <a href=\"https:\/\/research.google.com\/audioset\/ontology\/index.html\" target=\"_blank\" rel=\"noopener\">AudioSet Ontology<\/a> (a vocabulary of more than 600 <em>sound classes<\/em>). Currently, <strong>FSD is still much smaller than what we would like<\/strong>, but we are sure with the help of people all around the world it will get bigger and bigger. Needless to say, you are <strong>more than welcome to contribute<\/strong> to it (or in other words, <em> please contribute!<\/em>). All you need to do is visit the Freesound Datasets website and click on <em>Get started with our annotation tasks!<\/em> We will simply ask you to listen to some sounds and have fun \ud83d\ude42 You&#8217;ll see an interface like this (you can login with your Freesound credentials):<\/p>\n<p><a href=\"https:\/\/blog.freesound.org\/wp-content\/uploads\/2018\/04\/Annotation-screenshot.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-834\" src=\"https:\/\/blog.freesound.org\/wp-content\/uploads\/2018\/04\/Annotation-screenshot.png\" alt=\"\" width=\"1128\" height=\"790\" \/><\/a><\/p>\n<p>That&#8217;s really cool, isn&#8217;t it!?<\/p>\n<h2 style=\"text-align: center;\"><a href=\"https:\/\/datasets.freesound.org\/fsd\/annotate\/\">Yeah that&#8217;s awesome, take me to this interface because I can&#8217;t wait any longer to start annotating!<\/a><\/h2>\n<p>But you know what? <strong>There is even more! <\/strong>We have been awarded a <a href=\"https:\/\/research.googleblog.com\/2018\/03\/google-faculty-research-awards-2017.html\"> Google Faculty Research Award<\/a>?to support the development of Freesound Datasets and FSD, and, in relation to that, have started a collaboration with some colleagues from <a href=\"https:\/\/research.google.com\/teams\/perception\/\">Google&#8217;s Machine Perception Team<\/a> to do research on <a href=\"https:\/\/en.wikipedia.org\/wiki\/Computer_audition\">machine listening<\/a>. As the first outcome of this collaboration, we recently launched a competition in <a href=\"https:\/\/www.kaggle.com\/\">Kaggle<\/a>? (see <a href=\"https:\/\/www.kaggle.com\/c\/freesound-audio-tagging\"> Freesound General-Purpose Audio Tagging Challenge<\/a>), in which participants are challenged to build artificial intelligence algorithms <span class=\"s2\">able to recognize 41 diverse categories of everyday sounds. <\/span>The dataset used for this competition is a small subset of FSD.<\/p>\n<p>The great great great thing is that the <strong>outcomes of all these research efforts will help us improve Freesound in many ways<\/strong>. By training our search engine with FSD, we would, for example, be able to find search results <em>inside<\/em> sounds (for example, a fragment of a field recording with bird chirps), or be able to allow you to browse Freesound sounds using a hierarchical structure. This, and many other things that we will find out in the future \ud83d\ude42<\/p>\n<p>That&#8217;s it for now, thanks for reading&#8230;<\/p>\n<p>the Freesound Team<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Dear Freesounders, Today we are very happy to introduce you to Freesound Datasets, a new platform that we&#8217;ve been developing during the last year to foster the re-use of Freesound content in research contexts and that will eventually help us &hellip; <a href=\"https:\/\/blog.freesound.org\/?p=827\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":8,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-827","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/blog.freesound.org\/index.php?rest_route=\/wp\/v2\/posts\/827","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.freesound.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.freesound.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.freesound.org\/index.php?rest_route=\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.freesound.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=827"}],"version-history":[{"count":28,"href":"https:\/\/blog.freesound.org\/index.php?rest_route=\/wp\/v2\/posts\/827\/revisions"}],"predecessor-version":[{"id":857,"href":"https:\/\/blog.freesound.org\/index.php?rest_route=\/wp\/v2\/posts\/827\/revisions\/857"}],"wp:attachment":[{"href":"https:\/\/blog.freesound.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=827"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.freesound.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=827"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.freesound.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=827"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}