WIP: Add Elementwise Functions support #1500

SpiritSeeker · 2025-12-17T02:33:07Z

This PR adds support for elementwise functions (elementwise unary operations), allowing for easy extension to all hls math functions from HLS Math Library. The current PR adds support for ReLU, elementwise Exp, and elementwise Erf functions.

Status of tests:
✔️ ReLU - FLOAT32, FLOAT16, INT, FIXED (cppsim and rtlsim)
✔️ Exp and Erf - cppsim + FLOAT32
✔️ Exp - rtlsim: FLOAT32 and FLOAT16

✖️ Exp and Erf - cppsim + FLOAT16 (numpy and simulated results differ significantly)
✖️ Erf - rtlsim: FLOAT32 and FLOAT16 (RTL watchdog timeout error)

…ling)

…nctions

…functions

preusser

Thanks, @SpiritSeeker!
Please, review comments.

preusser · 2026-01-06T15:44:35Z

src/finn/custom_op/fpgadataflow/elementwise_functions.py

+    @property
+    def cpp_op(self):
+        odt_hls_name = self.out_dtype.get_hls_datatype_str()
+        return "({0} > 0 ? (%s){0} : (%s)0)" % (odt_hls_name, odt_hls_name)


The reversed comparison {0} < 0 is easier for most datatypes.

preusser · 2026-01-06T15:49:36Z

src/finn/custom_op/fpgadataflow/elementwise_functions.py

+            inp_bw = self.inp_dtype.bitwidth()
+            # The output would be unsigned with same bit-width as input
+            # if input was unsigned, else one bit less
+            out_bw = inp_bw - 1 if self.inp_dtype.signed() else inp_bw


Consider issuing a warning when constructing a ElementwiseReLU node with an unsigned input type.

You can only safely strip a bit from the output datatype if the input datatype is narrow, i.e. within [-2^(n-1) + 1 : 2^(n-1) - 1].

preusser · 2026-01-06T15:56:11Z

src/finn/custom_op/fpgadataflow/elementwise_functions.py

+        odt_hls_name = self.out_dtype.get_hls_datatype_str()
+        # Explicitly use the overloads, using hls::exp results in minor errors
+        if self.out_dtype.get_canonical_name() == "FLOAT32":
+            return "(hls::expf((%s){0}))" % (odt_hls_name)


return "hls::exp(%s({0}))" % (odt_hls_name) should be the only return statement. Rely on function overload selection by the argument type for specialization.

preusser · 2026-01-06T15:57:41Z

src/finn/custom_op/fpgadataflow/elementwise_functions.py

+        odt_hls_name = self.out_dtype.get_hls_datatype_str()
+        # Explicitly use the overloads, using hls::erf results in minor errors
+        if self.out_dtype.get_canonical_name() == "FLOAT32":
+            return "(hls::erff((%s){0}))" % (odt_hls_name)


return "hls::erf(%s({0}))" % (odt_hls_name) should be the only return statement. Rely on function overload selection by the argument type for specialization.

preusser · 2026-01-06T16:06:45Z

src/finn/custom_op/fpgadataflow/hls/elementwise_functions_hls.py

+    # Generates C++ code for declaring all streams involved in C++ simulation
+    # for testing
+    def strm_decl(self):
+        # Allways add the output stream to the declarations


Why not concise?:

self.code_gen_dict["$STREAMDECLARATIONS$"] = [ # Note: Assumes stream type aliases to be set in defines "OutStream out0_V;", "InpStream in0_V;" ]

preusser · 2026-01-06T16:11:42Z

src/finn/custom_op/fpgadataflow/hls/elementwise_functions_hls.py

+            #pragma HLS BIND_STORAGE variable=out type=RAM_S2P impl=LUTRAM
+            """,
+            # Perfect loop nest over all folded output dimensions
+            *[for_loop(dim, size) + " {" for dim, size in enumerate(out_shape)],


This seems to be equivalent to a single flat loop using the product over all dimensions as its bound.

preusser · 2026-01-06T16:15:44Z

src/finn/custom_op/fpgadataflow/hls/elementwise_functions_hls.py

+
+        # Add HLS interface directives specifying how to create RTL ports for
+        # the top-level function arguments
+        self.code_gen_dict["$PRAGMAS$"] += [


Fuse into a single compact append for both lines of code.

preusser · 2026-01-06T16:17:48Z

src/finn/custom_op/fpgadataflow/hls/elementwise_functions_hls.py

+    def get_verilog_top_module_intf_names(self):
+        # Start collecting interface names in a dictionary starting with clock
+        # and reset
+        intf_names = {"clk": ["ap_clk"], "rst": ["ap_rst_n"]}


Pick up all the other associations as part of the initialization.

SpiritSeeker and others added 5 commits December 8, 2025 21:04

[EltwiseFunc] Add support for elementwise functions (cppsim tests fai…

ffcf498

…ling)

[EltwiseFunc] Add elementwise math functions and run pre-commit

8145d4e

Merge remote-tracking branch 'origin/dev' into feature/elementwise-fu…

659f4d0

…nctions

[EltwiseFunc] increase tolerances for float16

aecd1a7

Merge remote-tracking branch 'upstream/dev' into feature/elementwise-…

7f11f1d

…functions

preusser reviewed Jan 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Add Elementwise Functions support #1500

WIP: Add Elementwise Functions support #1500

SpiritSeeker commented Dec 17, 2025

Uh oh!

preusser left a comment

Uh oh!

preusser Jan 6, 2026

Uh oh!

preusser Jan 6, 2026

Uh oh!

preusser Jan 6, 2026

Uh oh!

preusser Jan 6, 2026

Uh oh!

preusser Jan 6, 2026

Uh oh!

preusser Jan 6, 2026

Uh oh!

preusser Jan 6, 2026

Uh oh!

preusser Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WIP: Add Elementwise Functions support #1500

Are you sure you want to change the base?

WIP: Add Elementwise Functions support #1500

Conversation

SpiritSeeker commented Dec 17, 2025

Uh oh!

preusser left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants