Системы, в которых обработка данных распределена между множеством компьютеров в сети. Выберите подраздел для уточнения специфических подходов и архитектур.
Статьи по коду 004.75
361 публикаций
Нажмите рядом со статьёй — скопируете ссылку для списка литературы по ГОСТ.
Extreme-Scale Atomistic Simulation of Real-Temperature Magnetic Skyrmion Dynamics by Coupled Spin-Lattice Modeling
Pin Chen, Cheng-bing Chen, Hai Liu, Yuewen Huang, Kangyou Zhong, Hai-Jun Zhao, Liu-Liu Han, Guixin Guo, Jiang Li, Dan Huang, Ben Xu, Yutong Lu
· 2026
On the Limits of Causal Observation in Shared-Memory Systems
Gilde Valeria Rodríguez, Armando Castañeda, Miguel Piña
· 2026
The Multipath Reliable Connection (MRC) Transport
Rip Sohan, Eric Spada, Eric Davis, Mark Handley, Idan Burstein, Tony Hurson, Jithin Jose, Vivek Kashyap, Rong Pan, Sayantan Sur, Sreevatsa Anantharamu, Aviv Barnea, Adrian Caulfield, Elazar Cohen, Elliot Edmunds, Yamin Friedman, Mahdieh Ghazi, Murali Guramali, Torsten Hoefler, Vipin Jain, Abdul Kabbani, Noam Katz, Yanfang Le, Charlie Mbariky, Guglielmo Morandin, Masoud Moshref, Shane O'Neil, Michael Papamichael, Jonas Pfefferle, Siva Santosh Pyla, Costin Raiciu, David Riddoch, Karen Schramm, Yuv
· 2026
CoAgent: Concurrency Control for Multi-Agent Systems
Hongtao Lyu, Dingyan Zhang, Mingyu Wu, Xingda Wei, Haibo Chen
· 2026
Is RISC-V Ready for Massively Parallel Astrophysical Codes?
Jenny Lynn Almerol, Nitin Shukla, Federico Ficarelli, Geray S. Karademir, Andrea Bartolini, Emanuele Venieri, Giacomo Madella, Elisabetta Boella
· 2026
Latency Prediction for LLM Inference on NPU Systems
Juhyun Park, Seungwoo Jeong, Jingyu Lee, Kyungyong Lee
· 2026
Quantifying the Impact of Lossy Compression on Neural Generative Surrogate Modeling
Zhimin Li, Harshitha Menon, Charles Jekel, Valerio Pascucci, Peter Lindstrom
· 2026
PreLort: Prefix-Nested LoRA for Federated Fine-Tuning under Rank Heterogeneity
Muhammad Waseem, Nurbek Tastan, Andrej Jovanovic, Nicholas D. Lane, Nils Lukas, Karthik Nandakumar, Samuel Horvath
· 2026
Di5Guise: 5G Privacy with vSIM
Shirin Ebadi, Zach Moolman, Eric Keller, Tamara Lehman
· 2026
A Modern Large-Scale Memory Characterization Laboratory
Ataberk Olgun, Haocong Luo, Ismail Emir Yuksel, F. Nisa Bostanci, A. Giray Yaglikci, Onur Mutlu
· 2026
Tangram: Hiding GPU Heterogeneity for Efficient LLM Parallelization
Yanda Tao, Pedro F. Silvestre, Marcel Wagenländer, Peter Pietzuch
· 2026
Diagonal-Budgeted Trotterization for Efficient Quantum Hamiltonian Simulation
Srikar Chundury, Blake Burgstahler, Jiajia Li, In-Saeng Suh, Frank Mueller
· 2026
Maestro: Workload-Aware Cross-Cluster Scheduling for LLM-Based Multi-Agent Systems
Jinghao Wang, Xiao Zhou, Xiaoyang Sun, Yihui Zhang, Yilong Li, Tianyu Wo, Xu Wang, Chunming Hu, Renyu Yang
· 2026
Simple-IT: Practical Low-Latency Signature-Free BFT Consensus
Qianyu Yu, Juan Villacis, Giuliano Losa, Zhuolun Xiang, Xuechao Wang
· 2026
When the UE Turns Adversary: Real-Time Uplink Jamming from Within 5G Networks
Rosolino Alaimo, Alessandra Dino, Ilenia Tinnirello, Domenico Garlisi
· 2026
ITME: Inference Tiered Memory Expansion with Disaggregated CXL-Hybrid Memories
Hakbeom Jang, Younghoon Min, Sunwoong Kim, Taeyoung Ahn, Hanyee Kim, Youngpyo Joo, Hoshik Kim, Jongryool Kim
· 2026
NetCause: Counterfactual Learning for Root Cause Analysis in Large-Scale Networks
Fabien Chraim, Jian Zhang, Dominik Janzing, Xiang Song, Christos Faloutsos, John Evans
· 2026
Efficient and Robust Online Learning to Rank in Decentralized Systems
Marcel Gregoriadis, Martijn de Vos, Sayan Biswas, Anne-Marie Kermarrec, Johan Pouwelse
· 2026
Harnessing Routing Foresight for Micro-step-level MoE load balancing in RL Post-training
Yuming Zhou, Haoyang Li, Sheng Lin, Yanfeng Zhao, Tong Zhao, Xupeng Miao, Jie Jiang, Fangcheng Fu, Bin Cui
· 2026
The PM-EdgeMap: Towards Real-Time Process Mining on the Edge-Cloud Continuum
Hendrik Reiter, Christian Imenkamp, Olaf Landsiedel, Andrea Maldonado, Patrick Rathje, Wilhelm Hasselbring
· 2026
Chimera: Protocol-Aware Recovery for Confidential BFT Consensus
Tong Liu, Xiaoqing Wen, Ziwei Zhou, Si Liu, Jianyu Niu, Cong Wang, Yinqian Zhang
· 2026
AutoPilot: Learning to Steer High Speed Robust BFT
Liangrong Chen, Yue Zhang, Eric Zhou, Mohammad Javad Amiri, Ryan Marcus, Chenyuan Wu
· 2026
PCCL: Process Group-Aware Scalable and Generic Collective Algorithm Synthesizer
William Won, Kartik Lakhotia, Madhu Kumar, Sudarshan Srinivasan, Tushar Krishna
· 2026
A Neurosymbolic Prolog Skill for LLM-Driven Service Placement
Jacopo Massa, Giuseppe Bisicchia, Patrizio Dazzi, Antonio Brogi
· 2026
Piper: A Programmable Distributed Training System
Megan Frisella, Shubham Tiwari, Andy Ruan, Yi Pan, Parker Gustafson, Mat Jacob, Gilbert Bernstein, Stephanie Wang
· 2026
Demystifying NVSHMEM: A System-Level Analysis on Symmetric Memory and Device-Initiated Operations in GPU Communication
Yijun Ma, Siyuan Shen, Tiancheng Chen, Akhil Langer, Jiri Kraus, Benjamin Glick, Craig Belusar, Jeff Hammond, Torsten Hoefler
· 2026
UltraEP: Unleash MoE Training and Inference on Rack-Scale Nodes with Near-Optimal Load Balancing
Xinming Wei, Chao Jin, Tuo Dai, Yinmin Zhong, Shan Yu, Chengxu Yang, Bingyang Wu, Zili Zhang, Jing Mai, Qianchao Zhu, Zhouyang Li, Yuliang Liu, Guojie Luo
· 2026
Kairos: Lightweight Testing Framework for Timing-Induced Interaction Failures in LTE and 5G Core Networks
Wei Guo, Yuanhao Li, Hao Zheng, Junman Qin, Jun Kong, Jiapeng Li, Qiang Fu, Jiadai Wang, Jiajia Liu
· 2026
HetCCL: Enabling Collective Communication For Mixed-Vendor Heterogeneous Clusters
Yuejie Wang, Tao Chang, Yuanyuan Zhao, Yulong Ao, Zeyu Gu, Zhiyu Li, Yanmin Jia, Yan Zhang, Mingjun Zhang, He Liu, Yongzhe He, Yonghua Lin, Guyue Liu
· 2026
Contrastive Learning and Correlation Clustering for Sequences of Network Telescope Data
Jannik Presberger, Alexander Männel, Maynard Koch, Thomas C. Schmidt, Matthias Wählisch, Bjoern Andres
· 2026
Thou Shall Not Pass: Gatekeeping Outbound TLS Connections
Henrique B. Brum, Matteo Franzil, Riccardo Germenia, Salvatore Manfredi, Domenico Siracusa, Luis A. Dias Knob
· 2026
The local complexity of certifying parity
Nicolas Bousquet, Laurent Feuilloley, Jorge Valenzuela, Sébastien Zeitoun
· 2026
Offloading L7 Policies to the Kernel
Laurin Brandner, Ayush Mishra, Sebastiano Miano, Aurojit Panda, Gianni Antichi, Laurent Vanbever
· 2026
SIGMA: A Versatile Streaming Graph Partitioner for Vertex- and Edge-Balanced Distributed GNN Training
Barbara Hoffmann, Shai Dorian Peretz, Adil Chhabra, Ahmet Kadir Yalcinkaya, Ruben Mayer, Christian Schulz
· 2026
FlexNPU: Transparent NPU Virtualization for Dynamic LLM Prefill-Decode Co-location
Jiongjiong Gu, Jianfeng Wang, Zidong Han, Yongqiao Wang, Pengfei Xia, Mingjie Zhang, Hong Liu, Yuanyi Xia, Jiajia Chu, Yifeng Tang, Hui Zang, Xin Yao, Qijie Qiu, Yuzhao Wang, Chuanfei Xu, Lin Zhang, Zhuonan Lai, Hongming Huang, Jiawei Qiu, Gong Zhang, Zhong Ming, Weipeng Cao
· 2026
Strategies for Molecular Dynamics using Hybrid Systems: LAMMPS Use Case
Paulo Henrique Leme Ramalho, Dennis Alves Pedersen, Fábio Andrijauskas
· 2026
Discrete Incremental Voting: New Bounds for General Graphs and Expanders
Petra Berenbrink, Colin Cooper, Thorsten Götte, Lukas Hintze, Tomasz Radzik
· 2026
On GPU Implementation for Multi-Precision Integer Division
Martin B. Marchioro, Aske N. Raahauge, Marc I. Løvenskjold, Cosmin E. Oancea, Stephen M. Watt
· 2026
Keynote Speech
Willy Susilo
· 2022
RadioMaster: Multi-Agent System for Autonomous Radio Signal Generation
Jiazhen Lei, Tianze Cao, Yuxin Sha, Sihan Wang, Bingbing Wang, Fengyuan Zhu, Zeming Yang, Xiaohua Tian
· 2026
A Unified E2E Energy Efficiency Testing Framework for Open RAN
Marcin Hoffmann, Marcin Dryjański, Adrian Kliks, Andreas Gladisch, Ajesh Pulyaar Keerthi, Mohammadreza Razmi, Heiko Lehmann
· 2026
Waiting at the front door: Continuous monitoring of latency in the host network stack
Simon Sundberg, Anna Brunstrom, Simone Ferlin-Reiter, Jesper Dangaard Brouer, Toke Høiland-Jørgensen
· 2026
Fast TetraBFT: Optimizing Latency Where It Matters
Antonio J. Fernández-Pinto, Manuel Bravo, Gregory Chockler, Alexey Gotsman
· 2026
E2LLM: Towards Efficient LLM Serving in Heterogeneous Edge/Fog Environments
Truong-Thanh Le, Amir Taherkordi, Hoang-Loc La, Frank Eliassen, Phuong Hoai Ha, Peiyuan Guan
· 2026
GNN-based Online Beamforming Design for HAPS-Assisted NTN
Lavanya S S Anjapuli, Animesh Yadav, Halim Yanikomeroglu
· 2026
KISS: Keeping it Simple and Slotted when Learning to Communicate over Wireless
Kamil Szczech, Maksymilian Wojnar, Krzysztof Rusek, Katarzyna Kosek-Szott, Szymon Szott
· 2026
GPU Acceleration of Learning With Errors KEMs Using OpenACC for Post-Quantum Cryptography
Tiziana Liberati, Nitin Shukla, Matteo Barbieri, Gabriella Bettonte, Elisabetta Boella, Simone Rizzo, Daniele Gregori, Marco Pedicini
· 2026
GuidaPA: Privacy-Preserving Chatbot for Public Administration via Federated Learning
Daniel M. Jimenez-Gutierrez, Albenzio Cirillo, Raffaele Nicolussi, Alessio Beltrame, Andrea Vitaletti
· 2026
SQEEZ: Energy-efficient Location Sharing for Mobile Ad Hoc Networks
Ram Ramanathan, Dmitrii Dugaev, Ryan Conyac, Alon Mor, Charlie Greenbacker
· 2026
CA-AC-MPC: CUDA-Accelerated Actor-Critic Model Predictive Control
Antoonio Buo, Vittorio Cammarota, Michele Avagnale, Pierluigi Arpenti, Vincenzo Lippiello, Fabio Ruggiero
· 2026
Throughput-Optimized Networks at Scale
Conor James Green, Mithuna Thottethodi
· 2026
Resource Allocation in HyperX Networks
Alejandro Cano, Cristóbal Camarero, Carmen Martínez, Ramón Beivide
· 2026
EnCoR: An end-to-end architecture for simplifying cellular networks
Wesley Woo, Zhuowei Wen, Monniiesh Velmurugan, Richard Raad, Sylvia Ratnasamy, Scott Shenker, Shaddi Hasan
· 2026
Characterization-Guided GPU Fault Resilience in NVIDIA MPS
Rixin Liu, Xingqi Cui, Kaijian Wang, Xinheng Ding, Zirui Liu, Yuke Wang, Jiarong Xing
· 2026
Ciphera: A Decentralised Biometric Identity Framework
Ankit Kanaiyalal Prajapati, Shahzad Memon, Mohammed Mahir Rahman, Ameer Al-Nemrat
· 2026
RAFI -- A Ray/Work Forwarding Infrastructure for Data Parallel Multi-Node/Multi-GPU Computing
Ingo Wald, Serkan Demirci, Alper Sahistan, Stefan Zellmann, Andrea Paris, Patrick Moran, Milan Jaros, Tatiana von Landesberger, Ugur Gudukbay, Valerio Pascucci
· 2026
Optimus: Elastic Decoding for Efficient Diffusion LLM Serving
Chiyue Wei, Cong Guo, Bowen Duan, Junyao Zhang, Haoxuan Shan, Yifei Wang, Yangjie Zhou, Hai "Helen" Li, Danyang Zhuo, Yiran Chen
· 2026
DECICE: AI-Driven Scheduling and Digital Twin Integration for the Cloud-HPC-Edge Compute Continuum
Aasish Kumar Sharma, Felix Stein, Mirac Aydin, Michael Bidollahkhani, Sachin P. Nanavati, Mohsen Seyedkazemi Ardebili, Giorgi Mamulashvili, Mojtaba Akbari, Jonathan Decker, Zoya Masih, Julian M. Kunkel
· 2026
SLA-Aware Traffic Steering in Hybrid TN-NTN 5G Backhaul: A Potential Game Approach
Hojjat Navidan, Delia Rico, Mohammad Cheraghinia, Ingrid Moerman, Adnan Shahid
· 2026
AMP: Arc Multi-Proposer Protocol with Bounded Inclusion Guarantees
Daniel Cason, Gordon Liao, Sergio Mena, Nenad Milošević, Adi Seredinschi, Alessandro Sforzin, João Sousa, Preston Vander Vos
· 2026
Flare: Leveraging Serverless Elasticity to Absorb Microservice Load Spikes
Dilina Dehigama, Shyam Jesalpura, David Schall, Antonios Katsarakis, Marios Kogias, Rakesh Kumar, Boris Grot
· 2026
Ant Backpressure Routing for Dynamic Wireless Multi-hop Networks with Mixed Traffic Patterns
Negar Erfaniantaghvayi, Zhongyuan Zhao, Kevin Chan, Ananthram Swami, Santiago Segarra
· 2026
Impact of Atmospheric Turbulence and Pointing Error on Earth Observation
Celia Sánchez-de-Miguel, Antonio M. Mercado-Martínez, Beatriz Soret, Antonio Jurado-Navas, Miguel Castillo-Vázquez
· 2026
DRL-Driven Edge-Aware Utility Optimization for Multi-Slice 6G Networks
Khaled M. Naguib, Soumaya Cherkaoui, Mahmoud M. Elmessalawy, Ahmed M. Abd El-Haleem, Ibrahim I. Ibrahim
· 2026
Bandwidth-Aware LLM Inference on Heterogeneous Many-Core Supercomputers
Yao Lu, Zhongzhi Luan, Gen Li, Jiaxing Qi, Shiqing Ma, Bin Han, Shizhe Shang, Hailong Yang, Depei Qian
· 2026
Extreme-Scale Interconnection Networks
Alejandro Cano, Cristina Brinza, Cristóbal Camarero, Carmen Martínez, Ramón Beivide
· 2026
Nonlinear spectral clustering with C++ GraphBLAS
Dimosthenis Pasadakis, Olaf Schenk, Verner Vlacic, Albert-Jan Yzelman
· 2026
Autonomic Federated-Market Orchestration for the Edge-Cloud Continuum
Lauri Lovén, Roberto Morabito, Abhishek Kumar, Susanna Pirttikangas, Jukka Riekki, Sasu Tarkoma
· 2026
SafeSABR: Risk-Calibrated Adaptive Bitrate Streaming over Starlink Networks
Hongjun Xie, Jiahang Zhu, Zhiming Shao, Chao Fan, Zenghui Zhang, Genke Yang, Pengcheng Luo
· 2026
BShare: Packet Queueing Delay-Driven Buffer Sharing for Datacenter Switches
Krishna Agarwal, Muhamad Rizka Maulana, Vamsi Addanki, Habib Mostafaei
· 2026
Adaptive KV Cache Reuse for Fast Long-Context LLM Serving
Fei li, Song Liu, Yan Liu, Jinhua Cui, Shiqiang Nie, Jinyu Wang, Weiguo Wu
· 2026
The Time is Here for Just-in-Time Systems: Challenges and Opportunities
Shu Liu, Alexander Krentsel, Shubham Agarwal, Mert Cemri, Ziming Mao, Soujanya Ponnapalli, Alexandros G. Dimakis, Sylvia Ratnasamy, Matei Zaharia, Aditya Parameswaran, Ion Stoica
· 2026
Polar: Agentic RL on Any Harness at Scale
Binfeng Xu, Hao Zhang, Shaokun Zhang, Songyang Han, Mingjie Liu, Jian Hu, Shizhe Diao, Zhenghui Jin, Yunheng Zou, Michael Demoret, Jan Kautz, Yi Dong
· 2026
LiveR: Fine-Grained Elasticity via Live Reconfiguration for Model Training
Haoyuan Liu, Kairui Zhou, Shuyao Qi, Qinwei Yang, Shengkai Lin, Shizhen Zhao, Wei Zhang
· 2026
Nf-PEAK: Process-Based Energy Attribution for Nextflow Workflows on Kubernetes Clusters
Philipp Thamm, Somayeh Mohammadi, Kathleen West, Knut Reinert, Lauritz Thamsen, Ulf Leser
· 2026
HyperParallel-MoE: Multi-Core Interleaved Scheduling for Fast MoE Training on Ascend NPUs
Zewen Jin, Congkun Ai, Guangpeng Zhang, Hanbo Zhang, Haoran Wang, Shihan Xiao, Da Lei, Xuefeng Jin, Teng Su, Cheng Li
· 2026
Hybrid Edge-HPC Systems for Low-Latency Data-Driven Inference
Liubov Kurafeeva, Ryan Hartung, Benjamin Carter, Alan Subedi, Avhishek Biswas, Michael Fay, Shantenu Jha, Chandra Krintz, Andre Merzky, Douglas Thain, Memet Can Vuran, Rich Wolski
· 2026
Llamas on the Web: Memory-Efficient, Performance-Portable, and Multi-Precision LLM Inference with WebGPU
Reese Levine, Rithik Sharma, Nikhil Jain, Abhijit Ramesh, Zheyuan Chen, Neha Abbas, James Contini, Tyler Sorensen
· 2026
Instant GPU Efficiency Visibility at Fleet Scale
Connor Pedersen, Dong H. Ahn, Michel Migdal, Collin Neale, Nik Konyuchenko
· 2026
Mobility of Data in Distributed Hybrid Computing Systems
Philippe Faes, Mark Christiaens, Dirk Stroobandt
· 2007
Efficient Parallel CTL Model-Checking for Pushdown Systems
Xinyu Chen, Hansheng Wei, Xin Ye, Li Hao, Yanhong Huang, Jianqi Shi
· 2018
From Automated to Autonomous: Hierarchical Agent-native Network Architecture (HANA)
Binghan Wu, Shoufeng Wang, Yunxin Liu, Ya-Qin Zhang, Joseph Sifakis, Ye Ouyang
· 2026
iHAC: A Hybrid Cluster Architecture for Enhanced Performance and Resilience
Siddique Abubakr Muntaka, Edward Danso Ansong, Benjamin Yankson, Oliver Kornyo, Faiza Hussein, Mohammed Nadhir Muntaka, Joshua Dagadu, Prince Clement Addo, Maxwell Dorgbefu Jnr., Franco Osei-Wusu, Foster Yeboah, Michael Asante
· 2026
Ark: Offchain Transaction Batching in Bitcoin
Pim Keer, Matteo Maffei, Marco Argentieri, Andrew Camilleri, Zeta Avarikioti
· 2026
NanoCP: Request-Level Dynamic Context Parallelism for Data-Expert Parallel Decoding
Jiefei Chen, Binbin Lin, Jinming Ma, Jiangfei Duan, Haojie Duanmu, Hao Liu, Qinxiu Cheng, Xiuhong Li, Zhilin Pei, Hui Wang, Xingcheng Zhang, Dahua Lin
· 2026
LatentBox: Storing AI-Generated Images at Scale via a Latent-First Design
Zirui Wang, Yunjia Zheng, Tingfeng Lan, Zhaoyuan Su, Haoran Ni, Juncheng Yang, Yue Cheng
· 2026
Resilient Byzantine Agreement with Predictions
Julien Dallot, Darya Melnyk, Tijana Milentijevic, Stefan Schmid, Patrik Welters
· 2026
Special Issue on Cloud Computing
Gregory Chockler, Eliezer Dekel, Joseph JaJa, Jimmy Lin
· 2011
Distributed Stochastic Graph Algorithms
Keren Censor-Hillel, Aditi Dudeja, George Giakkoupis
· 2026
Frontier: Towards Comprehensive and Accurate LLM Inference Simulation
Yicheng Feng, Xin Tan, Yangtao Deng, Yimin Jiang, Yibo Zhu, Hong Xu
· 2026
PALS: Power-Aware LLM Serving for Mixture-of-Experts Models
Can Hankendi, Rana Shahout, Minlan Yu, Ayse K. Coskun
· 2026
NeuroRisk: Physics-Informed Neural Optimization for Risk-Aware Traffic Engineering
Yingming Mao, Ximeng Liu, Jingyi Cheng, Xiyuan Liu, Jiashuai Liu, Yike Liu, Zhen Yao, Yuzhou Zhou, Siyuan Feng, Qiaozhu Zhai, Shizhen Zhao
· 2026
Deep Tech to Space: Space Data Centers and AI Revolution at the Edge
Jonas Weiss, Patricia Sagmeister, Gabriel Maiolini Capez, Dinesh Verma, Roberto Garello, Alberto Perotti, Dawid Lazaj, Alicja Musial, Jakub Nalepa, Thomas Morf, Martin Schmatz, Marek Krawczyk, Mateusz Przeliorz, Kevin Roche, Sagar Tayal, Mahalakshmi Lakshminarayanan, Nicolas Longépé, Pierre-Philippe Mathieu, Agata Wijata
· 2026
Taking Cryptography Out of the Data Path via Near-Memory Processing in DRAM
Nicola Barcarolo, Brahmaiah Gandham, Mohammad Sadrosadati, Roberto Passerone, Onur Mutlu, Flavio Vella
· 2026
The Internet Runs on Names
Geoff Huston, Lixia Zhang
· 2026
EPIC: Abstraction and Polymorphism of In-Network Collectives on Ethernet
Yitao Yuan, Jianglong Nie, Tianyu Bai, Ruizhe Zhou, Siyuan Cao, Xujie Fan, Yuchen Xu, Junkai Chen, Chenqi Zhao, Nengyuan Zhang, Shaoke Fang, Jiangyuan Chen, Yuanfeng Chen, Jiaqi Sun, Zhan Wang, Xiaohua Xu, Yuchao Zhang, Yang Liu, Xiangrui Yang, Jing Lin, Xiaohe Hu, Yang Li, Chao Jiang, Limin Xiao, Weifeng Zhang, Junjie Wang, Wei Cheng, Yazhu Lan, Jianbo Dong, Binzhang Fu, Wenfei Wu
· 2026
PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications
Stephen Mell, David Mell, Konstantinos Kallas, Steve Zdancewic, Osbert Bastani
· 2026
Ranking Opinions with Few States in Population Protocols
Tom-Lukas Breitkopf, Julien Dallot, Antoine El-Hayek, Stefan Schmid
· 2026
Mosaic: Towards Efficient Training of Multimodal Models with Spatial Resource Multiplexing
Yanbo Wang, Yuxuan Wang, Chen Chen, Chunyu Xue, Yu Feng, Anbang Wu, Quan Chen, Yin Chen, Qizhen Weng
· 2026
РАЗРАБОТКА ИНТЕЛЛЕКТУАЛЬНОГО ЧАТ-БОТА ДЛЯ АВТОМАТИЗАЦИИ ОТВЕТОВ И АНАЛИЗА ПОТРЕБНОСТЕЙ ПОЛЬЗОВАТЕЛЕЙ: ИНФОРМАТИКА И ВЫЧИСЛИТЕЛЬНАЯ ТЕХНИКА
Мехдиев Эльнур Таджаддинович, Борисов Артём Андреевич, Ростоцский Максим Викторович, Шорохов Константин Дмитриевич, Кучук Максим Игоревич, и др.
· 2024
OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization
Zhongzhu Zhou, Donglin Zhuang, Jisen Li, Ziyan Chen, Shuaiwen Leon Song, Ben Athiwaratkun, Xiaoxia Wu
· 2026
TierCheck: Tiered Checkpointing for Fault Tolerance in Large Language Model Training
Shujie Han, Feng Jiang, Patrick P. C. Lee, Xiao Zhang, Zhijie Huang, Nannan Zhao, Xiaonan Zhao, Lichen Pan
· 2026
Guard: Scalable Straggler Detection and Node Health Management for Large-Scale Training
Guanliang Liu, Abhinandan Patni, Congzhu Lin, Zoe Zeng, Jack Wittmayer, Josh Wu, Ashvin Nihalani, Binxuan Huang, Yinghong Liu, Rory Na, Anthony Ko, Alexander Zhipa, Cong Cheng, Mi Sun, Vijay Rajakumar, Rejith George Joseph, Parthasarathy Govindarajen
· 2026
AdaptiveLoad: Towards Efficient Video Diffusion Transformer Training
Yucheng Guo, Yongjian Guo, Zhong Guan, Haoran Sun, Wen Huang, Wanting Xu, Jing Long, Shuai Di, Junwu Xiong
· 2026
More Than Meets the Eye: A Semantics-Aware Traffic Augmentation Framework for Generalizable Website Fingerprinting
Youquan Xian, Xueying Zeng, Lingjia Meng, Lei Cui, Runhan Song, Wei Wang, Zhengquan Ding, Peng Liu, Zhiyu Hao
· 2026
Avoiding Cross-Datacenter Collective Congestion via Disaggregated Buffering
Mariano Scazzariello, Noga H. Rotman, Dima Gavrilenko, Sajy Khashab, Alexander Shpiner, Matty Kadosh, Marco Chiesa, Dejan Kostic, Mark Silberstein
· 2026
A Few GPUs, A Whole Lotta Scale: Faithful LLM Training Emulation with PrismLLM
Shaoke Xi, ChonLam Lao, Boyi Jia, Jiaqi Gao, Zhipeng Zhang, Jiamin Cao, Brian Sutioso, Erci Xu, Minlan Yu, Kui Ren, Yong Li, Zhengping Qian, Ennan Zhai, Jingren Zhou
· 2026
Early-Stabilizing Counting
Christoph Lenzen, Julian Loss
· 2026
Ringmaster LMO: Asynchronous Linear Minimization Oracle Momentum Method
Abdurakhmon Sadiev, Artavazd Maranjyan, Ivan Ilin, Peter Richtárik
· 2026
Fast Gossip-based Rumor Spreading using Small Messages
Fabien Dufoulon, William K. Moses, Gopal Pandurangan
· 2026
Online optimization for scheduling preemptable tasks on IaaS cloud systems
Jiayin Li, Meikang Qiu, Zhong Ming, Gang Quan, Xiao Qin, Zonghua Gu
· 2012
MLCommons Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces
Srinivas Sridharan, Andy Balogh, Bradford M. Beckmann, Brian Coutinho, Louis Feng, Sheng Fu, Sanshan Gao, Mehryar Garakani, Taekyung Heo, David Kanter, Josh Ladd, Ziwei Li, Winston Liu, Changhai Man, Dan Mihailescu, Spandan More, Joongun Park, Ashwin Ramachandran, Vinay Ramakrishnaiah, Saeed Rashidi, Vijay Janapa Reddi, Puneet Sharma, Phio Tian, William Won, Hanjiang Wu, Huan Xu, Jinsun Yoo, Tushar Krishna
· 2026
Kairos: A Scalable Serving System for Physical AI
Yinwei Dai, Ganesh Ananthanarayanan, Landon Cox, Xenofon Foukas, Bozidar Radunovic, Ravi Netravali
· 2026
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding
Yunhe Han, Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, Yitong Duan, Pei Xiao, Yanfeng Zhang
· 2026
Swarm Network-as-a-Service (SNaaS)
Balsam Alkouz, Osama Amin, Basem Shihada
· 2026
+ Добавить статью