Allowlist for endpoint restriction
0.9.13
Project and dataset secrets now support an allowlist for endpoint restriction.
Stay up to date with the latest features and improvements of the Mabyduck subjective testing platform.
0.9.13
Project and dataset secrets now support an allowlist for endpoint restriction.
0.9.12
This release comes with a variety of improvements.
0.9.11
Organisations can now use Okta OIDC for authentication.
Pairwise video experiments also now have a sequential video player as an option.
0.9.10
This release brings new session and rater health scores.
We've also improved the robustness of both AI raters and pairwise video experiments.
0.9.9
You can now set up project-level secrets.
We've also made it possible to filter plots by parameters, and embedded experiments can now store custom metadata on slates.
0.9.8
Audio surveys now support a new "highlight transcript" question type, allowing raters to highlight words in a transcript. Survey experiments also gain a new "Percent chosen" metric for radio buttons and checkboxes, with new plots of these metrics on the results pages.
Custom strategies can now also be fetched via the API.
0.9.7
Active sampling strategies have been improved, with fixes to the initial phase of sampling unobserved conditions.
We've also enabled Stripe payments for organisation billing.
0.9.6
This release comes with a variety of improvements.
0.9.5
Jobs now have a new "pending" status, and will automatically abort when an experiment is deleted. We've also added automatic checks for fragmentation of externally hosted mp4s, and disabled the ability to force-launch jobs via the API.
This release also brings rater performance leaderboards, and a handful of other API improvements.
0.9.4
We've updated API keys to be associated with users
0.9.3
You can now set up project webhooks for dataset status changes, job completion, and session completion, with webhook and secret management available in both the API and admin.
We've also improved the reliability of embedded experiments including clearer API error reporting, wider time estimate support, automatic retrying of failed media parsing, and config files now filtered out of uniform strategy stimuli.
0.9.2
Hero videos can now be set to take over the full screen. They are also now configurable directly via the UI.
0.9.1
Pairwise video experiments can now include arbitrary checkboxes.
We've also added Elo support for MUSHRA experiments.
0.9.0
Experiments can now include a custom hero image or video via hero_media_url, along with an optional hero_caption. Experiment API responses also now return hero_media_url in all cases, with a null value when not set.
We've also increased the character limit for experiment introduction text.
0.8.9
Pairwise video experiments with a discrete or continuous response type now support a "tie" option.
0.8.8
A few more updates in this release
0.8.7
This release comes with a variety of improvements.
0.8.6
We've introduced a method that reinterprets rankings as pairwise comparisons, improving the efficiency of the Plackett-Luce metric.
We've also replaced rating buttons with sliders in pairwise video experiments, and ACR image experiments now support continuous scores and sliders too.
Also fixed a hover state issue in bar plots.
0.8.5
Added new response types and support for multiple dimensions in pairwise video experiments.
It is now possible to create datasets, experiments, and jobs in a single API request. Custom strategies are also now supported.
This release added S3 storage support for customer data alongside a new Organisation model. We also fixed an issue with large file uploads in the browser, which now use multipart uploads.
0.8.4
Added a "scale to fit" option for ACR image experiments.
0.8.3
It is now possible to use self-hosted datasets by providing a list of URLs, instead of uploading media files directly to us.
We also improved our API, and it is now possible to launch jobs where previously it was only possible to configure drafts via the API.
0.8.1
Our embedded experiments are now widely available. This type of experiment uses JavaScript to include arbitrary content in experiments, and is ideal for running interactive studies.
0.7.9
We released new types of experiments that allow the configuration of arbitrary surveys below images, audio, or video.
0.7.8
We improved our support for datasets with very large numbers of conditions. This is useful, for example, when you want to collect labels for training and need to label a large number of audio, images, or videos that are not AI-generated.
0.7.7
Selection strategies have received more configuration options. For example, it is now possible to evaluate only a subset of a dataset. It is also possible to always include one method in pairwise comparisons against other methods.
This release also makes it possible to scale (instead of cropping) images in pairwise image experiments.
0.7.5
We added the ability to add configurable plots to rubrics and leaderboards.
It is now also possible to create draft experiments and jobs via our API.
0.7.4
Today, we are opening up Mabyduck to everyone.
0.7.3
We made small tweaks to our design and changes to our backend to prepare for a public launch.
0.7.2
This release contained several improvements:
0.7.1
We added optional confidence regions to line graph visualizations of your results.
This release also adds support for references in ACR audio experiments.
0.7.0
This release comes with a variety of improvements.
0.6.3
We improved the handling of very large dataset uploads through the browser. If a dataset upload is interrupted for any reason, it is now possible to resume uploads.
This version also adds a new Markdown input field for writing introductions.
0.6.2
We introduced the ability for raters to leave feedback on individual slates and alert us to any potential issues with an experiment.
This version also updated the leaderboards' design.
0.2.6
We internationalized our experiments. In addition to English, we now support French and German.
Additionally, different experiments can now use different config files. This allows you to upload a single dataset with multiple config files for different experiments.
0.2.5
Pairwise image experiments now support references. We also introduced new configuration options for the MUSHRA experiment.
0.2.2
We introduced a new pairwise image experiment. We also added a way to preview images in datasets.
0.2.0
We have implemented our own pre-screening protocols. This allows us to provide you with a higher quality of raters whose ability and hardware enable them to detect fine differences between stimuli.
0.1.6
It is now possible to launch experiments to crowd-sourced raters through our platform.
0.1.5
We added configuration options to change how waveforms are rendered in MUSHRA experiments. In particular, it is now possible to only render the waveform of the reference so that raters can not draw conclusions based on the waveform.
0.1.4
Release 0.1.4 is packed with new features:
0.1.3
We addressed some minor bugs in the MUSHRA experiment.
0.1.2
Datasets now support config files. These can be used to change the interface for each slate. For example, to display text prompts next to stimuli.
0.1.0
Today, we are excited to release a private beta version of Mabyduck to our design partners.