mm256_srli,slli_si256; mm256_bsrli,bslli_epi128 to const generics by minybot · Pull Request #1067 · rust-lang/stdarch

minybot · 2021-03-09T14:03:33Z

f16c: _mm256_cvtps_ph; mm_cvtps_ph

rust-highfive · 2021-03-09T14:03:37Z

(rust-highfive has picked a reviewer for you, use r? to override)

minybot · 2021-03-09T14:23:41Z

At f16c.rs
_mm256_cvtps_ph(a: __m256, imm_rounding: i32). The current imm_rounding is set to 0-7.
I checked the Clang, it accepts 0-255.
Any suggestion?

Amanieu · 2021-03-09T15:47:55Z

The instruction definition here says that bits 3 to 7 are ignored by the CPU. I think to be safe we should only allow imm3, we can always relax it later if necessary.

minybot · 2021-03-09T16:14:39Z

The instruction definition here says that bits 3 to 7 are ignored by the CPU. I think to be safe we should only allow imm3, we can always relax it later if necessary.
Ok. I will finish f16c.

lqd · 2021-03-09T16:20:34Z

I'm still wondering about the fact that we "* 8" the immediates that are supposed to be in bytes and <= 16 for the shifts

Amanieu · 2021-03-09T16:32:00Z

It might be better to switch the implementation to use a shuffle like clang does and like we already do for _mm_slli_si128.

minybot · 2021-03-09T16:52:35Z

It might be better to switch the implementation to use a shuffle like clang does and like we already do for _mm_slli_si128.
Ok. I will modify it to similar to _mm_slli_si128.

minybot · 2021-03-09T18:57:07Z

It might be better to switch the implementation to use a shuffle like clang does and like we already do for _mm_slli_si128.

It seems mm256_slli_si256 = mm256_bslli_epi128?

Amanieu · 2021-03-09T22:20:30Z

Yes, see #1012.

Amanieu · 2021-03-09T22:22:59Z

 use crate::{
    core_arch::{simd::*, x86::*},
-    hint::unreachable_unchecked,
+    //    hint::unreachable_unchecked,


Deleted commented code.

Amanieu · 2021-03-09T22:23:28Z

-    }
-    transmute(constify_imm8!(imm8 * 8, call))
+    let r = vpslldq(a, IMM8 * 8);
+    transmute(r)


You can just call _mm256_bslli_epi128 here.

Amanieu · 2021-03-09T22:23:40Z

-    }
-    transmute(constify_imm8!(imm8 * 8, call))
+    let r = vpsrldq(a, IMM8 * 8);
+    transmute(r)


You can just call _mm256_bsrli_epi128 here.

minybot · 2021-03-09T22:24:11Z

Yes, see #1012.

Thanks. I think my bsrli_epi128 and bslli_epi128 having problems. I need to check them first.

mm256_srli,slli_si256; mm256_bsrli,bslli_epi128

77e893a

rust-highfive assigned Amanieu Mar 9, 2021

_mm256_cvtps_ph; mm_cvtps_ph

6d83ff5

lqd mentioned this pull request Mar 9, 2021

Convert the last avx512f and avx512vpclmulqdq intrinsics #1068

Merged

fix mm256_bslli_epi128, mm256_bsrli_epi128

7dc9e91

Amanieu reviewed Mar 9, 2021

View reviewed changes

fix mm256_srli_si256, mm256_slli_si256, mm512_bsrli_epi128

a1952a0

Amanieu merged commit 3559569 into rust-lang:master Mar 10, 2021

minybot deleted the avx2 branch March 10, 2021 14:29

marcgalois mentioned this pull request May 18, 2021

Regression on nightly in AVX2 byte shift intrinsics rust-lang/rust#85446

Closed

Conversation

minybot commented Mar 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rust-highfive commented Mar 9, 2021

Uh oh!

minybot commented Mar 9, 2021

Uh oh!

Amanieu commented Mar 9, 2021

Uh oh!

minybot commented Mar 9, 2021

Uh oh!

lqd commented Mar 9, 2021

Uh oh!

Amanieu commented Mar 9, 2021

Uh oh!

minybot commented Mar 9, 2021

Uh oh!

minybot commented Mar 9, 2021

Uh oh!

Amanieu commented Mar 9, 2021

Uh oh!

Amanieu Mar 9, 2021

Choose a reason for hiding this comment

Uh oh!

Amanieu Mar 9, 2021

Choose a reason for hiding this comment

Uh oh!

Amanieu Mar 9, 2021

Choose a reason for hiding this comment

Uh oh!

minybot commented Mar 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

minybot commented Mar 9, 2021 •

edited

Loading