Update test and benchmark generation for blockwise kr>2 #6557

GregoryComer · 2024-06-12T09:16:14Z

This PR updates test generation for blockwise (qb4w) kernels in preparation for ISA-specific kernels with kr > 2. Blockwise kernels currently enforce several constraints:

Kc is divisible by block size (no partial blocks in a column).
Block size is divisible by 2*kr*sr (inner loop is 2x unrolled for 2 planes, no partial epilogue).

The majority of the existing gemm tests cover edge cases that are incompatible with the above constraints, so this PR conditionally disables the irrelevant tests for blockwise kernels. Blockwise kernels still have tests for strides, M subtiles, N subtiles, and varying block/k sizes.

Blockwise tests also benefit from linear stepping of nc and kc. Existing test logic always calls NextPrime after incrementing nc and kc values. To allow for linear stepping in certain blockwise tests while also maintaining the existing behavior, I've tentatively added a step_type parameter to LoopParams, allowing for either linear or prime steps. The default value for step_type is set in GemmTestParams to maintain the existing behavior of prime steps for nc and kc.

Additionally, benchmark logic for blockwise kernels now rounds kc up to a multiple of 2*kr*sr.

All tests were regenerated via scripts/generate-tests.sh.

test/gemm-microkernel-tester.h

GregoryComer · 2024-06-12T09:21:48Z

test/gemm-microkernel-tester.h

 struct LoopParams {
  LoopParams() = default;
-  explicit LoopParams(size_t from, size_t to, size_t step)
-      : is_set(true), from(from), to(to), step(step) {}
+  explicit LoopParams(size_t from, size_t to, size_t step, LoopStepType step_type)


If preferred, I could change this to a boolean step_prime variable or similar. I figured the enum class approach is slightly more readable, but does feel less idiomatic to the codebase. Feel free to weigh in on this if you prefer it differently.

GregoryComer · 2024-06-12T09:22:13Z

test/gemm-microkernel-tester.h

@@ -549,23 +583,3 @@ struct GemmTestParams {
 };

 using GemmTest = testing::TestWithParam<GemmTestParams>;
-


See above comment - methods are moved to the top of the header.

GregoryComer · 2024-06-12T09:23:41Z

tools/generate-gemm-test.py

@@ -236,93 +236,52 @@ def split_ukernel_name(name):
        , test_func, isa_check)
        .loop_n(1, nr)
        .loop_m(1, mr));
-  if (k_block > 1) {


The diff looks messy, but all that actually changed here is that block of tests that are incompatible with the blockwise kernel assumptions are wrapped in the $if KERNELTYPE not in ['qb4w']. I did re-generate all tests via scripts/generate-tests.sh and there are no functional changes to existing tests.

GregoryComer · 2024-06-13T12:53:32Z

@alankelly Hey Alan, could you take a look at this PR when you have time? Once this is merged, we can start posting the ISA-specific qb4w kernels. Thanks.

alankelly · 2024-06-26T21:52:52Z

@gonnet Can you please review?

GregoryComer commented Jun 12, 2024

View reviewed changes

test/gemm-microkernel-tester.h Show resolved Hide resolved

GregoryComer commented Jun 12, 2024

View reviewed changes

GregoryComer marked this pull request as ready for review June 12, 2024 09:25

GregoryComer mentioned this pull request Jun 17, 2024

QB4W MLAL GEMM Kernels #6574

Open

mcr229 mentioned this pull request Jun 17, 2024

QB4W SSE2/SSE41 GEMM Kernels #6576

Open

GregoryComer force-pushed the qb4w-test-multi-kc branch from af15e05 to 9f6c40d Compare June 18, 2024 07:58

digantdesai mentioned this pull request Jun 20, 2024

QB4W Development #6502

Open

11 tasks

GregoryComer force-pushed the qb4w-test-multi-kc branch from 9f6c40d to eac5331 Compare June 24, 2024 21:09

GregoryComer mentioned this pull request Jun 24, 2024

QB4W AVX2 GEMM Kernels #6618

Open

GregoryComer force-pushed the qb4w-test-multi-kc branch from eac5331 to 01d3ee3 Compare June 24, 2024 22:40

Update test gen and benchmark for blockwise kr>2

d974e3c

GregoryComer force-pushed the qb4w-test-multi-kc branch from 01d3ee3 to d974e3c Compare June 24, 2024 22:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update test and benchmark generation for blockwise kr>2 #6557

Update test and benchmark generation for blockwise kr>2 #6557

GregoryComer commented Jun 12, 2024 •

edited

Loading

GregoryComer Jun 12, 2024

GregoryComer Jun 12, 2024

GregoryComer Jun 12, 2024

GregoryComer commented Jun 13, 2024

alankelly commented Jun 26, 2024

		@@ -549,23 +583,3 @@ struct GemmTestParams {
		};

		using GemmTest = testing::TestWithParam<GemmTestParams>;

Update test and benchmark generation for blockwise kr>2 #6557

Are you sure you want to change the base?

Update test and benchmark generation for blockwise kr>2 #6557

Conversation

GregoryComer commented Jun 12, 2024 • edited Loading

GregoryComer Jun 12, 2024

Choose a reason for hiding this comment

GregoryComer Jun 12, 2024

Choose a reason for hiding this comment

GregoryComer Jun 12, 2024

Choose a reason for hiding this comment

GregoryComer commented Jun 13, 2024

alankelly commented Jun 26, 2024

GregoryComer commented Jun 12, 2024 •

edited

Loading