Edge AI Development Engineer/Researcher
Aleron
United States, Texas, Austin
Jun 05, 2025
Description
The AI team enables state-of-the-art ML and DL model development across our hardware portfolio, using sophisticated model compression techniques to deploy previously impractical AI tasks to battery-powered environments. Our team of data scientists research neural architectures best suited to our customer's needs, select those models most amenable to deployment on our platform, and train them carefully tuning for memory, compute, and energy constraint tradeoffs. Finally, we publish our findings and socialize them via conferences, workshops, and publications.
Beyond a healthy obsession with computational efficiency, the successful candidate will be comfortable with operating in a "version zero' environment, marshaling internal, open source, and third-party resources to solve our customer's problems quickly and elegantly.
Requirements
Identify, refine, and/or develop sophisticated ML and DL models for deployment on highly constrained environments.
Train models using SOTA compression techniques to fit in specific memory, compute, and power envelopes, making trade-offs between compression and accuracy.
Publish and maintain these models in a Model Zoo, including, documentation, and other assets needed by our customers to bootstrap their internal AI features.
Socialize their achievements via conferences, meetups, workshops, and publications.
Job Requirements
Required Skills / Qualifications:
Master's Degree
Minimum 2 years experience pruning, distillation, quantization approaches for CNNs and RNNs
Minimum 2 years experience with TensorFlow and Pytorch
Minimum 1 year experience with embedded systems
Preferred Skills / Qualifications:
PhD
Experience with one or more of the following AI task domains: audio classification, speech, vision, and/or time series tasks, including domain-specific feature extraction related to those tasks
Tensorflow (TFLite, TFLite for Microcontrollers, and/or PyTorch are a plus)
Dataset creation and curation
Bonus Qualifications
Past "TinyML" involvement or experience
Experience developing and optimizing for TFLite for Microcontrollers
Experience with embedded C/C++ environments
Experience with compression of attention-based architectures
Experience with model-to-binary compilers (IREE, MicroTVM, etc)
Experience with ONNX, TOSA, Jax, LLVM, and/or MLIR
Experience with optimizing for heterogenous AI compute (e.g. CPU+NPU+DSP)
Aleron companies (Acara Solutions, Aleron Shared Resources, Broadleaf Results, Lume Strategies, TalentRise, Viaduct) are an Equal Opportunity Employer. Race/Color/Gender/Religion/National Origin/Disability/Veteran.
Applicants for this position must be legally authorized to work in the United States. This position does not meet the employment requirements for individuals with F-1 OPT STEM work authorization status.
Apply
Aleron
United States, Texas, Austin
Jun 05, 2025
Description
The AI team enables state-of-the-art ML and DL model development across our hardware portfolio, using sophisticated model compression techniques to deploy previously impractical AI tasks to battery-powered environments. Our team of data scientists research neural architectures best suited to our customer's needs, select those models most amenable to deployment on our platform, and train them carefully tuning for memory, compute, and energy constraint tradeoffs. Finally, we publish our findings and socialize them via conferences, workshops, and publications.
Beyond a healthy obsession with computational efficiency, the successful candidate will be comfortable with operating in a "version zero' environment, marshaling internal, open source, and third-party resources to solve our customer's problems quickly and elegantly.
Requirements
Identify, refine, and/or develop sophisticated ML and DL models for deployment on highly constrained environments.
Train models using SOTA compression techniques to fit in specific memory, compute, and power envelopes, making trade-offs between compression and accuracy.
Publish and maintain these models in a Model Zoo, including, documentation, and other assets needed by our customers to bootstrap their internal AI features.
Socialize their achievements via conferences, meetups, workshops, and publications.
Job Requirements
Required Skills / Qualifications:
Master's Degree
Minimum 2 years experience pruning, distillation, quantization approaches for CNNs and RNNs
Minimum 2 years experience with TensorFlow and Pytorch
Minimum 1 year experience with embedded systems
Preferred Skills / Qualifications:
PhD
Experience with one or more of the following AI task domains: audio classification, speech, vision, and/or time series tasks, including domain-specific feature extraction related to those tasks
Tensorflow (TFLite, TFLite for Microcontrollers, and/or PyTorch are a plus)
Dataset creation and curation
Bonus Qualifications
Past "TinyML" involvement or experience
Experience developing and optimizing for TFLite for Microcontrollers
Experience with embedded C/C++ environments
Experience with compression of attention-based architectures
Experience with model-to-binary compilers (IREE, MicroTVM, etc)
Experience with ONNX, TOSA, Jax, LLVM, and/or MLIR
Experience with optimizing for heterogenous AI compute (e.g. CPU+NPU+DSP)
Aleron companies (Acara Solutions, Aleron Shared Resources, Broadleaf Results, Lume Strategies, TalentRise, Viaduct) are an Equal Opportunity Employer. Race/Color/Gender/Religion/National Origin/Disability/Veteran.
Applicants for this position must be legally authorized to work in the United States. This position does not meet the employment requirements for individuals with F-1 OPT STEM work authorization status.
Apply
Job ID: 480271717
Originally Posted on: 6/7/2025
Want to find more Chemistry opportunities?
Check out the 17,737 verified Chemistry jobs on iHireChemists
Similar Jobs