- Responsible for an analytical model implementation of LLM inference and training memory usage
- Responsible for running the performance simulation to extract the workload's characteristics such as memory footprint and bandwidth requirement.
- Responsible for evaluation ideas for performance improvement
Knowledge in one or more of the following areas, computer architecture , performance modeling, and analytical model
- Knowledge and experience with common LLM (Large Language Model) workloads.
- Proficiency in C or C++, and scripting languages such as Python.
- Experience with high-level simulators for performance or power estimation is a plus.
- Knowledge in server-class GPU/ML architecture is a plus.
Monthly based
Puli Township, Taiwan, Taiwan
Puli Township, Taiwan, Taiwan