Introduction

notdiamond-0001 automatically determines whether to send queries to GPT-3.5 or GPT-4, depending on which model is best-suited for your task. We've trained notdiamond-0001 on hundreds of thousands of data points from robust, cross-domain evaluation benchmarks to maximize accuracy while drastically reducing costs and latency. When you call the notdiamond-0001 endpoint, it will return a label for either GPT-3.5 or GPT-4. You determine which version of each model you want to use and make the calls client-side with your own keys.

notdiamond-0001 also offers outage rerouting to a model of your choice in the case that OpenAI's servers are down.

Unlike deterministic routers, you'll notice that notdiamond-0001 doesn't route based on simple categories or domains. Instead, routing decisions are far more fine-grained. To get you started, here are some examples of prompts that get routed to either GPT-3.5 or GPT-4:

Sends to GPT-3.5Sends to GPT-4
What is prostate cancer?Can you help me understand whether this prostate pathology report could indicate the presence of cancer?
CLINICAL DATA: A-H: ELEVATED PROSTATE, PROSTATE [A] LEFT BASE… [continues]
What are common causes of a 401 error?This function is throwing a 401 error when it's being called. Do you see anything that could be contributing to that?
def get_user_data(request, response, db: Session = Depends(get_db))… [continues]
What does this paragraph mean?: “As your perspective of the world increases not only is the pain it inflicts on you less but also its meaning… [continues]Please help me complete this paragraph with a pared down description in the style of Karl Ove Knausgård:
The broken cup lay on the table.