close
Skip to main content
Filter by
Sorted by
Tagged with
Advice
1 vote
2 replies
70 views

I am making a game that I am gonna include assembly for the best graphics, since I have a amd cpu, I use nasm so is that mean if I compile nasm code to .exe would it still work on other cpus?
Yasheen Bambaranda's user avatar
2 votes
2 answers
137 views

Intel instruction VCVTSD2USI (and some similar AVG512 instruction with GPRs) is encoded as EVEX.LLIG.F2.0F.W0 79 /r VCVTSD2USI r32, xmm1/m64{er}, for instance VCVTSD2USI EAX,XMM0 is assembled to 62 F1 ...
vitsoft's user avatar
  • 6,097
0 votes
1 answer
64 views

PyTorch supports Intel GPU through torch.xpu, but PyTorch Lightning does not currently have built-in XPU accelerator support. Because NeuralForecast uses Lightning under the hood, that also blocks ...
Marek Ozana's user avatar
Advice
0 votes
5 replies
111 views

I have done some tests and verified that on more-"recent" Intel and AMD processors, the cache line prefetcher behaves differently when a line belongs to a base page vs a huge page. How is ...
Mani's user avatar
  • 135
Advice
1 vote
3 replies
144 views

I've been looking at the zlib1.dll that comes with Win 11 Pro and I was hoping for some assistance with the following passage: 56b:    b8 71 80 07 80          mov    eax,0x80078071 [570:    41 0f 42 ...
Gyst's user avatar
  • 1
2 votes
0 answers
69 views

Assume this code in x86_64 assembly: section .data msg db "Hello, world!" section .text global _start _start: ;; system call 1 is sys_write mov rax, 1 ...
Bram Fran's user avatar
  • 133
0 votes
0 answers
78 views

I have this bootloader I made a while ago and I would like it to be in nasm: .intel_syntax noprefix .code16 .equ STACK_TOP, 0x7C00 .equ SELF_LOAD, 0x7C00 .equ ELF_HDR_LOAD, 0x7E00 .equ SECT_SIZE, ...
Connor Thomson's user avatar
6 votes
1 answer
214 views

My code is ... fragment1 // compares several regions in D1$ to D1$/D3$ __atomic_fetch_add(&lock,-1,__ATOMIC_ACQ_REL); // stmt A fragment2 // moves several regions from D1$/D3$ to D1$ ...
Henry Rich's user avatar
1 vote
1 answer
135 views

I wonder if Intel (and Intel compatible) CPUs have an instruction (for diagnostic/debugging purposes) which, for a given linear address, returns the result of paging translation (i.e. the ...
Ruslan's user avatar
  • 184
1 vote
0 answers
103 views

I am reading a paragraph about the tbb::parallel_scan algorithm from the book Intel Threading Building Blocks, and I understood what the operation does serially, but I am not understanding what are ...
luczzz's user avatar
  • 456
Best practices
1 vote
2 replies
139 views

My assembly program reads characters in a text file by loading them one by one in register 'al'. However I sometime need to use rax fully, and I think this causes a partial register stall. Now I think ...
Kun Xiang's user avatar
0 votes
1 answer
125 views

I am trying to implement Cache allocation Technology`s impact with my CPU. However, when I use either lscpu to see whether my CPU supports, or cpuid -l 0x10, output is false. How is this possible? How ...
Ali Hosseini's user avatar
7 votes
1 answer
248 views

I'm experimenting with the IMUL r64, r64 instruction on an Intel Xeon E5-1620 v3 (Haswell architecture, base clock 3.5 GHz, turbo boost up to 3.6 GHz, Hyper Threading is enabled). My test loop is ...
Andrey Dmitriev's user avatar
3 votes
1 answer
161 views

I have a JavaFX desktop application that started having rendering issues after updating the Intel Iris Xe graphics driver. On Java 11 + JavaFX (Zulu distribution): openjdk version "11.0.25" ...
Guilherme Almeida's user avatar
2 votes
0 answers
102 views

A while ago, I was training a deep learning model on a computer without an NVIDIA GPU but with an Intel GPU. I only used the CPU for training, which was painfully slow. It suddenly occurred to me: can ...
Famhap's user avatar
  • 21

15 30 50 per page
1
2 3 4 5
238