Introduction
notdiamond-0001 automatically determines whether to send queries to GPT-3.5 or GPT-4, depending on which model is best-suited for your task. We've trained notdiamond-0001 on hundreds of thousands of data points from robust, cross-domain evaluation benchmarks to maximize accuracy while drastically reducing costs and latency. When you call the notdiamond-0001 endpoint, it will return a label for either GPT-3.5 or GPT-4. You determine which version of each model you want to use and make the calls client-side with your own keys.
notdiamond-0001 also offers outage rerouting to a model of your choice in the case that OpenAI's servers are down.
Unlike deterministic routers, you'll notice that notdiamond-0001 doesn't route based on simple categories or domains. Instead, routing decisions are far more fine-grained. To get you started, here are some examples of prompts that get routed to either GPT-3.5 or GPT-4:
Sends to GPT-3.5 | Sends to GPT-4 |
---|---|
What is prostate cancer? | Can you help me understand whether this prostate pathology report could indicate the presence of cancer? CLINICAL DATA: A-H: ELEVATED PROSTATE, PROSTATE [A] LEFT BASE… [continues] |
What are common causes of a 401 error? | This function is throwing a 401 error when it's being called. Do you see anything that could be contributing to that? def get_user_data(request, response, db: Session = Depends(get_db))… [continues] |
What does this paragraph mean?: “As your perspective of the world increases not only is the pain it inflicts on you less but also its meaning… [continues] | Please help me complete this paragraph with a pared down description in the style of Karl Ove Knausgård: The broken cup lay on the table. |
Updated about 1 month ago