OpenMP and MPI
As an HPC enthusiast eager to work in parallel computing architectures performance tuning and contribute towards exascale computing
Experienced in performance analysis at SoC interfaces and fine tuning HPC/ML workloads
Being interested to work in power-on eco systems have successfully collaborated multiple projects power-on while driving PnP activities
Working development of performance models and characterizing latency-bandwidth profiles for SoC interfaces on Intel Xeon servers.
Current exploration includes CXL memory performance profiling and identifying gaps with respect to core, cache coherency interconnects and DRAM-CXL memory balance.
C programming
Python programming
Linux kernel programming
GPGPU and SoC architecture level performance analysis and optimization
Memory architecture and HBM perf optimization at hardware and software level
PCIE architectural understanding and perf optimization
Xe-link/NV-link debug and optimization
SoC level power management and optimizations using Pmax, Psys, RAPL
OpenMP and MPI
OpenCL
The project deals with implementation of real time scheduling algorithm in kernel of non-real time OS for real time systems by reducing the latency time. The implementation gives a result of 50 to 70 ns latency improvement approx for modified kernel of non-real time OS.
https://www.ijitee.org/wp-content/uploads/papers/v8i11/K25320981119.pdf
Received DRA for Excellent planning and execution for Discrete Server GPU based workloads for performance tuning.