The person will be Responsible for failure analysis on Customer returned Server/ GPU boards. Hence the person should be able to understand the HW/System design properly before any debug step can be taken.
Roles and Responsibilities:-
- Data Analysis/Communication/Issue Resolution – Prevention
- Have ability to perform system and board level testing and debugging down to components level.
- Have knowledge to do component swapping, removal to isolate failures and overall deeper FA
- Visual mechanical inspection (VMI) of Server/GPU board components (Motherboards, GPU, GPU baseboard, CPU, DIMM, NIC, SSD, Power Supply, etc.) and/or electronics components
- Completes component level trouble shooting (capacitor, resistors, fuse, IC, diode, etc.) and failure analysis
Must to have:-
- Need atleast 8 Years to upto 25 Years of experience on Hardware design/Testing/System design/Testing- Mainly on server product
- Perform system and board level testing and debugging down to components level – Must to have
- Server knowledge is highly desirable. (BIOS / BMC / CPLD / FPGA, etc)- Design/Testing/debugging– Must to have
- Read and interpret schematics/block diagrams with detailed understanding of server and subassembly functionality – Must to have
- Visual mechanical inspection (VMI) of Server/GPU board components (Motherboards, GPU, GPU baseboard, CPU, DIMM, NIC, SSD, Power Supply, etc.) and/or electronics components – Must to have
- Completes component level trouble shooting (capacitor, resistors, fuse, IC, diode, etc.) and failure analysis – Must to have
- knowledge to do component swapping, removal to isolate failures and overall deeper FA – Must to have
- Knowledge of basic Linux environment and commands – Good to have