Language Tutor
The idea from Heather is that you could have a language tutor that could interactively coach a student to continually improve their language skills. While, certainly, this is not a new idea, it is related to Heatherβs area of expertise, i.e. how to effectively communicate concepts to students & teachers, how to get them to learn & retain information.
Product Ideas
- It should be multimodal, using pictures, audio, text, & video to iteratively test & assist the learner.
- It should know which learner it is dealing with so that it continually tailors its training.
- It should be engaging, e.g. maybe it could teach you about anything that you want to know about while it also teaches you fluency in some language. Of course, this instruction should also be multimodal.
- It should be able to target specific skill areas for more rapid development, i.e. if the student really wants to learn how to order at restaurants as soon as possible, then the instruction should attempt to get the student there as quickly as possible.
- It should be able have a particular conversation with the student, e.g. Teacher, Iβd like to talk about classical architecture with you in Spanish. Then the teacher would start a conversation with the student, continuing where that particular student left off last time or starting from the beginning if need be. Where the student struggled, the teacher would assist & offer repetition later in the conversation or in other unrelated conversations where applicable.
- It should be able to take in a video or image & teach the observer how to describe & discuss it.
Product Questions
What should its name be? Teacher, β¦ Β What device would this teacher work through? iPad, laptop, phone, echo show type device with all mediums possible, chromebook, β¦ Β Is the original intention for use in the classroom? Which medium is the best one for the classroom? Β How could we obtain customers? What do the potential customers want? Β What is the simplest product that has demonstrable value? Is it audio questioning with feedback on the response?
Development Questions
What are the building blocks?
ββββββββββββββββ
β Transcribe β Translate
ββββββββββββββββ Text
Translate ββββββββββββββββ
Audio β Topical β
β Conversation β
ββββββββββββββββ
Conversation in a different language. Conversation with a different historical figure. Β Which order should things be built in? Β What is the simplest product we could get working quickly? Β How could we encode & compress the history of each student in order to iteratively build on that history toward mastery? If a 30 minute conversation in a new language is ~5k words & we can fit 1M words into context, then we can build up to 100 hours of context into model prompt. Itβs likely that we can keep only the last X hours of conversation as context. This implies tradeoffs between perfect previous information & speed, cost, etcβ¦ Β How fast can a local model be? How small can a local model be? Local models really change the cost game. Is it worth trying to become an expert in what Ollama is doing in order to be able to customize everything to run locally where possible?
What is the online chat bot competition called? What can I learn from it?
Transcribe
Weβll need to be able to compare what was said against the correct word or phrase. Will we be able to use probabilities from the model to say how excellent the pronunciation was? Transcription is one way to do this, get a transcription, keep track of the level of confidence & which words are less certain, & feed the transcription to another model that can score it against the target & offer feedback.
Japanese
Start with Japanese. How would you, as an agent teach someone Japanese if they knew nothing about it? Could you lead them in an immersive tale that required them to use language tasks to complete?
Audio
What size are audio samples, i.e. can they be sent quickly & cheaply to & from APIs?
Are audio models small enough to run on user devices?
Deploy models to AWS using the SageMaker SDK.