Note: Data management requires access to Dojo (only available to current customers). Miso Answers engine can work with a variety of data.Documentation Index
Fetch the complete documentation index at: https://docs.miso.ai/llms.txt
Use this file to discover all available pages before exploring further.
- Your content catalog. This includes content that’s consumed by your readers, such as a news article or opinion piece. This is the data the Answers engine will use to answer user questions. As a best practice, teams usually send us the full catalog and then filter out what should be excluded from Answers. This helps us avoid uncertainties about what has or hasn’t been uploaded to Miso. You can continue to upload/delete data at any time; Miso does not charge for indexing content.
- You can also send transcripts for videos and audio files.
- (Optional) Databases and lists. This is useful when you may want the system to extract specific data values rather than combing through the content archive.
- e.g. profile pages about an investment fund, design agencies, supermarkets, hospitals, etc.
- e.g. ranked lists such as Top 50 or award winners
Data Upload Options
You have two options for providing data:- Wordpress integration
- Upload via API
- Send data through SFTP
Integrating With Miso’s Data APIs
Miso’s Data APIs let you automatically upload and manage your data in Miso. These APIs all support high-throughput data ingestion through bulk insert and satisfy GDPR and CCPA compliance by letting users delete their data from Miso.Uploading and Managing Your Product Catalog Data
Miso’s Product APIs let you upload, read, and delete Product records that represent your site’s content.API Throughput
We recommend batching up your Product Upload API calls and sending around 100 records at a time to avoid timeout and memory risks.Product Data Model Design
To fully optimize your Answers engine (and increase our ability to accommodate finetuning requirements), it is important to provide Miso with Product records that are complete and accurate. We define a set of common attributes that capture the basics of most content media products, such astitle, description, categories, tags, authors, etc. You can also use custom_attributes to specify any additional information from your catalog. Miso can handle hundreds of custom attributes, so don’t feel reluctant to provide as much catalog metadata as you have.
How to Use Custom Attributes
If your products’ characteristics cannot be fully captured by the fields that Miso defines, you can also specifycustom_attributes. The more complete the product information is, the smarter Miso becomes. For example, if your product summaries support multiple languages, you can have something like: custom_attributes.alternative_langs: ["en", "zh"].
You might also want to consider including attributes that are required for presenting summaries in the front end (e.g. cover image, raw rating scores etc.) so that you don’t need additional requests to fetch those fields.
Sometimes the design of custom attributes can require some thought. Your Miso solutions architects are available to help you think through the engineering implications. For example, if you have products that are only available in certain regions, how should you represent that? There are a few options:
- Have a list like:
{"regional_availability": ["region_1", "region_3", ...]}that contains all the regions that currently have the product - Have multiple attributes like:
{"region_1_availability": true, "region_2_availability": false, ....}, where each attribute represents the availability of a particular region.
Viewing Data
In Dojo, you can see the data that you’ve uploaded to Miso in a visual dashboard. See the Dojo Data Sets guide for more details. You can also read your Product and User data via API.Deleting Data
Dojo provides a way to delete data records and even wipe all the data in your environment. See the Dojo Data Sets guide for more details. Programmatically, data can be deleted via API as well. Here is an example of how to delete data in bulk: Get all product ids:Troubleshooting
422 Errors (Schema Validation)
When a data upload fails, it is usually due to a schema (formatting) error. Any schema error will cause the whole request to fail: the API will returnstatus_code=422, and none of the records will be inserted. You should check the data field in the response to see where the errors are located. For example, the response below means there are schema errors in the interaction record at index 0: