About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
Remote job listings
Find your next remote opportunity from thousands of listings across the globe.
Filters
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
›
Country-wide jobs always show. Picking a state also reveals jobs limited to that state.
Annual salary floor. Hourly-only roles pass through.
- Remote (Global)
- $70 – $100/hr
- Jun 9, 2026
About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
- Remote (Global)
- $70 – $100/hr
- Jun 9, 2026
About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
- Remote (Global)
- $70 – $100/hr
- Jun 9, 2026
About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
- Remote (Global)
- $70 – $100/hr
- Jun 9, 2026
About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
- Remote (Global)
- $70 – $100/hr
- Jun 9, 2026
About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
- Remote (Global)
- $70 – $100/hr
- Jun 9, 2026
About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
- Remote (Global)
- $70 – $100/hr
- Jun 9, 2026
About the Project We're building a large-scale evaluation benchmark for advanced AI reasoning across scientific and engineering domains. Task designers create challenging computational problems that test whether AI systems can use real scientific software tools to solve…
- United States
- $40/hr
- May 16, 2026
Centific is hiring a Research Intern focused on Multimodal LLM Benchmarking to contribute to advanced AI evaluation research involving multimodal foundation models. This internship focuses on designing, executing, and analyzing benchmark systems for AI models operating across:…
No jobs available
Check back soon for new remote job opportunities.
Tips for finding remote jobs
- Set up job alerts on multiple platforms to never miss an opportunity.
- Highlight your remote work experience and self-management skills.
- Prepare for video interviews and remote work assessments.
- Customize your resume and cover letter for each remote position.
- Build a strong online presence on LinkedIn and professional networks.