AI Model testing framework – draft

Created a doc to help engineers assess LLMs and troubleshoot results.