Is there some meaningful statistical data to justify keeping signed integer arithmetic overflow undefined?Detecting signed overflow in C/C++Why does integer overflow on x86 with GCC cause an infinite loop?Do C99 signed integer types defined in stdint.h exhibit well-defined behaviour in case of an overflow?Efficient unsigned-to-signed cast avoiding implementation-defined behaviorWhy is unsigned integer overflow defined behavior but signed integer overflow isn't?Signed integers' undefined behavior and Apple Secure Coding GuideDoes Visual C++ consider signed integer overflow undefined?Does integer overflow cause undefined behavior because of memory corruption?I'm having some difficulty understanding these comments about detecting integer overflowsSigned Integer value overflow in C++?

How can Sam Wilson fulfill his future role?

Is every story set in the future "science fiction"?

if i accidentally leaked my schools ip address and someone d doses my school am i at fault

Two (probably) equal real numbers which are not proved to be equal?

How to get MAX value using SOQL when there are more than 50,000 rows

Employee is self-centered and affects the team negatively

Ugin's Conjurant vs. un-preventable damage

Program for finding longest run of zeros from a list of 100 random integers which are either 0 or 1

"Estrontium" on poster

Why did they wait for Quill to arrive?

Row vectors and column vectors (Mathematica vs Matlab)

How do I minimise waste on a flight?

Is there any evidence to support the claim that the United States was "suckered into WW1" by Zionists, made by Benjamin Freedman in his 1961 speech

Are on’yomi words loanwords?

Examples where existence is harder than evaluation

Can I use a 11-23 11-speed shimano cassette with the RD-R8000 11-speed Ultegra Shadow Rear Derailleur (short cage)?

Are there vaccine ingredients which may not be disclosed ("hidden", "trade secret", or similar)?

How is Arya still alive?

What's an appropriate age to involve kids in life changing decisions?

How can I make parentheses stick to formula?

Was Mohammed the most popular first name for boys born in Berlin in 2018?

Passport stamps art, can it be done?

What is the radius of the circle in this problem?

When do you stop "pushing" a book?

Is there some meaningful statistical data to justify keeping signed integer arithmetic overflow undefined?

Detecting signed overflow in C/C++Why does integer overflow on x86 with GCC cause an infinite loop?Do C99 signed integer types defined in stdint.h exhibit well-defined behaviour in case of an overflow?Efficient unsigned-to-signed cast avoiding implementation-defined behaviorWhy is unsigned integer overflow defined behavior but signed integer overflow isn't?Signed integers' undefined behavior and Apple Secure Coding GuideDoes Visual C++ consider signed integer overflow undefined?Does integer overflow cause undefined behavior because of memory corruption?I'm having some difficulty understanding these comments about detecting integer overflowsSigned Integer value overflow in C++?

.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;

The C Standard explicitly specifies signed integer overflow as having undefined behavior. Yet most CPUs implement signed arithmetics with defined semantics for overflow (except maybe for division overflow: x / 0 and INT_MIN / -1).

Compilers writers have been taking advantage of the undefinedness of such overflows to add more aggressive optimisations that tend to break legacy code in very subtle ways. For example this code may have worked on older compilers but does not anymore on current versions of gcc and clang:

/* Tncrement a by a value in 0..255, clamp a to positive integers.
 The code relies on 32-bit wrap-around, but the C Standard makes
 signed integer overflow undefined behavior, so sum_max can now 
 return values less than a. There are Standard compliant ways to
 implement this, but legacy code is what it is... */
int sum_max(int a, unsigned char b) 
 int res = a + b;
 return (res >= a) ? res : INT_MAX;

Is there hard evidence that these optimisations are worthwhile? Are there comparative studies documenting the actual improvements on real life examples or even on classical benchmarks?

I came up with this question as I was watching this: C++Now 2018: John Regehr “Closing Keynote: Undefined Behavior and Compiler Optimizations”

I am tagging c and c++ as the problem is similar in both languages but the answers might be different.

edited 7 hours ago

asked 7 hours ago

chqrlie

65.7k853111

Comments are not for extended discussion; this conversation has been moved to chat.

– Samuel Liew♦
3 hours ago

1

The reason C says signed integer overflow is undefined is that some CPUs use "2's complement", some use "1's compliment", some use "sign and magnitude"; and for all the cases overflow could cause anything (e.g. CPUs like MIPS have "trap on overflow"). In other words it's about portability and not optimisation.

– Brendan
2 hours ago

Exactly. The only 'meaningful statistic' anybody needs is that ones-complement and sign-magnitude computers exist.

– user207421
1 hour ago

add a comment |

/* Tncrement a by a value in 0..255, clamp a to positive integers.
 The code relies on 32-bit wrap-around, but the C Standard makes
 signed integer overflow undefined behavior, so sum_max can now 
 return values less than a. There are Standard compliant ways to
 implement this, but legacy code is what it is... */
int sum_max(int a, unsigned char b) 
 int res = a + b;
 return (res >= a) ? res : INT_MAX;

Is there hard evidence that these optimisations are worthwhile? Are there comparative studies documenting the actual improvements on real life examples or even on classical benchmarks?

I came up with this question as I was watching this: C++Now 2018: John Regehr “Closing Keynote: Undefined Behavior and Compiler Optimizations”

I am tagging c and c++ as the problem is similar in both languages but the answers might be different.

edited 7 hours ago

asked 7 hours ago

chqrlie

65.7k853111

Comments are not for extended discussion; this conversation has been moved to chat.

– Samuel Liew♦
3 hours ago

1

The reason C says signed integer overflow is undefined is that some CPUs use "2's complement", some use "1's compliment", some use "sign and magnitude"; and for all the cases overflow could cause anything (e.g. CPUs like MIPS have "trap on overflow"). In other words it's about portability and not optimisation.

– Brendan
2 hours ago

Exactly. The only 'meaningful statistic' anybody needs is that ones-complement and sign-magnitude computers exist.

– user207421
1 hour ago

add a comment |

/* Tncrement a by a value in 0..255, clamp a to positive integers.
 The code relies on 32-bit wrap-around, but the C Standard makes
 signed integer overflow undefined behavior, so sum_max can now 
 return values less than a. There are Standard compliant ways to
 implement this, but legacy code is what it is... */
int sum_max(int a, unsigned char b) 
 int res = a + b;
 return (res >= a) ? res : INT_MAX;

Is there hard evidence that these optimisations are worthwhile? Are there comparative studies documenting the actual improvements on real life examples or even on classical benchmarks?

I came up with this question as I was watching this: C++Now 2018: John Regehr “Closing Keynote: Undefined Behavior and Compiler Optimizations”

I am tagging c and c++ as the problem is similar in both languages but the answers might be different.

edited 7 hours ago

asked 7 hours ago

chqrlie

65.7k853111

/* Tncrement a by a value in 0..255, clamp a to positive integers.
 The code relies on 32-bit wrap-around, but the C Standard makes
 signed integer overflow undefined behavior, so sum_max can now 
 return values less than a. There are Standard compliant ways to
 implement this, but legacy code is what it is... */
int sum_max(int a, unsigned char b) 
 int res = a + b;
 return (res >= a) ? res : INT_MAX;

Is there hard evidence that these optimisations are worthwhile? Are there comparative studies documenting the actual improvements on real life examples or even on classical benchmarks?

I came up with this question as I was watching this: C++Now 2018: John Regehr “Closing Keynote: Undefined Behavior and Compiler Optimizations”

I am tagging c and c++ as the problem is similar in both languages but the answers might be different.

c++ c language-lawyer signed integer-overflow

edited 7 hours ago

asked 7 hours ago

chqrlie

65.7k853111

edited 7 hours ago

asked 7 hours ago

chqrlie

65.7k853111

edited 7 hours ago

asked 7 hours ago

chqrlie

65.7k853111

asked 7 hours ago

chqrlie

65.7k853111

asked 7 hours ago

chqrlie

65.7k853111

Comments are not for extended discussion; this conversation has been moved to chat.

– Samuel Liew♦
3 hours ago

1

The reason C says signed integer overflow is undefined is that some CPUs use "2's complement", some use "1's compliment", some use "sign and magnitude"; and for all the cases overflow could cause anything (e.g. CPUs like MIPS have "trap on overflow"). In other words it's about portability and not optimisation.

– Brendan
2 hours ago

Exactly. The only 'meaningful statistic' anybody needs is that ones-complement and sign-magnitude computers exist.

– user207421
1 hour ago

add a comment |

Comments are not for extended discussion; this conversation has been moved to chat.

– Samuel Liew♦
3 hours ago

1

The reason C says signed integer overflow is undefined is that some CPUs use "2's complement", some use "1's compliment", some use "sign and magnitude"; and for all the cases overflow could cause anything (e.g. CPUs like MIPS have "trap on overflow"). In other words it's about portability and not optimisation.

– Brendan
2 hours ago

Exactly. The only 'meaningful statistic' anybody needs is that ones-complement and sign-magnitude computers exist.

– user207421
1 hour ago

Comments are not for extended discussion; this conversation has been moved to chat.

– Samuel Liew♦
3 hours ago

The reason C says signed integer overflow is undefined is that some CPUs use "2's complement", some use "1's compliment", some use "sign and magnitude"; and for all the cases overflow could cause anything (e.g. CPUs like MIPS have "trap on overflow"). In other words it's about portability and not optimisation.

– Brendan
2 hours ago

Exactly. The only 'meaningful statistic' anybody needs is that ones-complement and sign-magnitude computers exist.

– user207421
1 hour ago

add a comment |

3 Answers
3

active

oldest

votes

Not quite an example of optimization, but one useful consequence of undefined behaviour is -ftrapv command line switch of GCC/clang. It inserts code which crashes your program on integer overflow.

It won't work on unsigned integers, in accordance with the idea that unsigned overflow is intentional.

The Standard's wording on signed integer overflow ensures that people won't write overflowing code on purpose, so ftrapv is a useful tool to discover unintentional overflow.

answered 6 hours ago

anatolyg

17.4k44794

add a comment |

The answer is actually in your question:

Yet most CPUs implement signed arithmetics with defined semantics

I can't think of a CPU that you can buy today that does not use twos-compliment arithmetic for signed integers, but that wasn't always the case.

The C language was invented in 1972. Back then, IBM 7090 mainframes still existed. Not all computers were twos-compliment.

To have defined the language (and overflow behaviour) around 2s-compliment would have been prejudicial to code generation on machines that weren't.

Furthermore, as it has already been said, specifying that signed overflow is to be UB allows the compiler to produce better code, because it can discount code paths that result from signed overflow, assuming that this will never happen.

If I understand correctly that it's intended to clamp the sum of a and b to 0....INT_MAX without wraparound, I can think of two ways to write this function in a compliant way.

First, the inefficient general case that will work on all cpus:

int sum_max(int a, unsigned char b) 
 if (a > std::numeric_limits<int>::max() - b)
 return std::numeric_limits<int>::max();
 else
 return a + b;

Second, the surprisingly efficient 2s-compliment specific way:

int sum_max2(int a, unsigned char b) 
 unsigned int buffer;
 std::memcpy(&buffer, &a, sizeof(a));
 buffer += b;
 if (buffer > std::numeric_limits<int>::max())
 buffer = std::numeric_limits<int>::max();
 std::memcpy(&a, &buffer, sizeof(a));
 return a;

Resulting assembler can be seen here: https://godbolt.org/z/F42IXV

answered 2 hours ago

Richard Hodges

57.4k658105

add a comment |

I don't know about studies and statistics, but yes, there are definitely optimizations taking this into account that compilers actually do. And yes, they are very important (tldr loop vectorization for example).

Besides the compiler optimizations, there is another aspect to be taken into account. With UB you get C/C++ signed integers to behave arithmetically as you would expect mathematically. For instance x + 10 > x holds true now, but would not on a wrap-around behavior.

I've found an excellent article How undefined signed overflow enables optimizations in GCC from Krister Walfridsson’s blog listing some optimizations that take signed overflow UB into account. The following examples are from it. I am adding c++ and assembly examples to them.

If the optimizations look too simple, uninteresting or unimpactful, remember that these optimization are just steps in a much much larger chain of optimizations. And the butterfly effect does happen as a seemingly unimportant optimization at an earlier step can trigger a much more impactful optimization at a later step.

If the examples look nonsensical (who would write x * 10 > 0) keep in mind that you can very easily get to this kind of examples in C and C++ with constants, macros, templates. Besides the compiler can get to this kind of examples when applying transformations and optimizations in its IR.

Signed integer expression simplification

Eliminate multiplication in comparison with 0

(x * c) cmp 0 -> x cmp 0

bool foo(int x) return x * 10 > 0

foo(int):
 test edi, edi
 setg al
 ret

Eliminate division after multiplication

(x * c1) / c2 -> x * (c1 / c2) if c1 is divisible by c2
```
int foo(int x) return (x * 20) / 10; 
```
```
foo(int):
 lea eax, [rdi+rdi]
 ret
```

Eliminate negation

(-x) / (-y) -> x / y

int foo(int x, int y) return (-x) / (-y);

foo(int, int):
 mov eax, edi
 cdq
 idiv esi
 ret

Simplify comparisons that are always true or false

x + c < x -> false
x + c <= x -> false
x + c > x -> true
x + c >= x -> true

bool foo(int x) return x + 10 >= x;

foo(int):
 mov eax, 1
 ret

Eliminate negation in comparisons

(-x) cmp (-y) -> y cmp x

bool foo(int x, int y) return -x < -y;

foo(int, int):
 cmp edi, esi
 setg al
 ret

Reduce magnitude of constants

x + c > y -> x + (c - 1) >= y
x + c <= y -> x + (c - 1) < y

bool foo(int x, int y) return x + 10 <= y;

foo(int, int):
 add edi, 9
 cmp edi, esi
 setl al
 ret

Eliminate constants in comparisons

(x + c1) cmp c2 -> x cmp (c2 - c1)
(x + c1) cmp (y + c2) -> x cmp (y + (c2 - c1)) if c1 <= c2
The second transformation is only valid if c1 <= c2, as it would
otherwise introduce an overflow when y has the value INT_MIN.

bool foo(int x) return x + 42 <= 11;

foo(int):
 cmp edi, -30
 setl al
 ret

Pointer arithmetic and type promotion

If an operation does not overflow, then we will get the same result if
we do the operation in a wider type. This is often useful when doing
things like array indexing on 64-bit architectures — the index
calculations are typically done using 32-bit int, but the pointers are
64-bit, and the compiler may generate more efficient code when signed
overflow is undefined by promoting the 32-bit integers to 64-bit
operations instead of generating type extensions.

One other aspect of this is that undefined overflow ensures that a[i]
and a[i+1] are adjacent. This improves analysis of memory accesses for
vectorization etc.

This is a very important optimization as loop vectorization one of the most efficient and effective optimization algorithms.

It is trickier to demonstrate. But I remember actually encountering a situation when changing an index from unsigned to signed drastically improved the generated assembly. Unfortunately I cannot remember or replicate it now. Will come back later if I figure it out.

Value range calculations

The compiler keeps track of the variables' range of possible values at
each point in the program, i.e. for code such as
int x = foo();
if (x > 0) {
 int y = x + 5;
 int z = y / 4;
it determines that x has the range [1, INT_MAX] after the
if-statement, and can thus determine that y has the range [6, INT_MAX] as overflow is not allowed. And the next line can be
optimized to int z = y >> 2; as the compiler knows that y is
non-negative.

auto foo(int x)

 if (x <= 0)
 __builtin_unreachable();

 return (x + 5) / 4;

foo(int):
 lea eax, [rdi+5]
 sar eax, 2
 ret

The undefined overflow helps optimizations that need to compare two
values (as the wrapping case would give possible values of the form
[INT_MIN, (INT_MIN+4)] or [6, INT_MAX] that prevents all useful
comparisons with < or >), such as

Changing comparisons x<y to true or false if the ranges for x and y does not overlap

Changing min(x,y) or max(x,y) to x or y if the ranges do not overlap

Changing abs(x) to x or -x if the range does not cross 0

Changing x/c to x>>log2(c) if x>0 and the constant c is a power of 2

Changing x%c to x&(c-1) if x>0 and the constant c is a power of 2

Loop analysis and optimization

The canonical example of why undefined signed overflow helps loop
optimizations is that loops like
for (int i = 0; i <= m; i++)
are guaranteed to terminate for undefined overflow. This helps
architectures that have specific loop instructions, as they do in
general not handle infinite loops.

But undefined signed overflow helps many more loop optimizations. All
analysis such as determining number of iteration, transforming
induction variables, and keeping track of memory accesses are using
everything in the previous sections in order to do its work. In
particular, the set of loops that can be vectorized are severely
reduced when signed overflow is allowed.

edited 59 mins ago

answered 1 hour ago

bolov

34.4k878142

"x + 10 > x holds true now" - no it doesn't. It could format your hard disk if x happens to equal INT_MAX-5.

– immibis
18 mins ago

add a comment |

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f56047702%2fis-there-some-meaningful-statistical-data-to-justify-keeping-signed-integer-arit%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

3 Answers
3

active

oldest

votes

3 Answers
3

active

oldest

votes

Not quite an example of optimization, but one useful consequence of undefined behaviour is -ftrapv command line switch of GCC/clang. It inserts code which crashes your program on integer overflow.

It won't work on unsigned integers, in accordance with the idea that unsigned overflow is intentional.

The Standard's wording on signed integer overflow ensures that people won't write overflowing code on purpose, so ftrapv is a useful tool to discover unintentional overflow.

answered 6 hours ago

anatolyg

17.4k44794

add a comment |

Not quite an example of optimization, but one useful consequence of undefined behaviour is -ftrapv command line switch of GCC/clang. It inserts code which crashes your program on integer overflow.

It won't work on unsigned integers, in accordance with the idea that unsigned overflow is intentional.

The Standard's wording on signed integer overflow ensures that people won't write overflowing code on purpose, so ftrapv is a useful tool to discover unintentional overflow.

answered 6 hours ago

anatolyg

17.4k44794

add a comment |

Not quite an example of optimization, but one useful consequence of undefined behaviour is -ftrapv command line switch of GCC/clang. It inserts code which crashes your program on integer overflow.

It won't work on unsigned integers, in accordance with the idea that unsigned overflow is intentional.

The Standard's wording on signed integer overflow ensures that people won't write overflowing code on purpose, so ftrapv is a useful tool to discover unintentional overflow.

answered 6 hours ago

anatolyg

17.4k44794

Not quite an example of optimization, but one useful consequence of undefined behaviour is -ftrapv command line switch of GCC/clang. It inserts code which crashes your program on integer overflow.

It won't work on unsigned integers, in accordance with the idea that unsigned overflow is intentional.

The Standard's wording on signed integer overflow ensures that people won't write overflowing code on purpose, so ftrapv is a useful tool to discover unintentional overflow.

answered 6 hours ago

anatolyg

17.4k44794

answered 6 hours ago

anatolyg

17.4k44794

answered 6 hours ago

anatolyg

17.4k44794

answered 6 hours ago

anatolyg

17.4k44794

add a comment |

The answer is actually in your question:

Yet most CPUs implement signed arithmetics with defined semantics

I can't think of a CPU that you can buy today that does not use twos-compliment arithmetic for signed integers, but that wasn't always the case.

The C language was invented in 1972. Back then, IBM 7090 mainframes still existed. Not all computers were twos-compliment.

To have defined the language (and overflow behaviour) around 2s-compliment would have been prejudicial to code generation on machines that weren't.

If I understand correctly that it's intended to clamp the sum of a and b to 0....INT_MAX without wraparound, I can think of two ways to write this function in a compliant way.

First, the inefficient general case that will work on all cpus:

int sum_max(int a, unsigned char b) 
 if (a > std::numeric_limits<int>::max() - b)
 return std::numeric_limits<int>::max();
 else
 return a + b;

Second, the surprisingly efficient 2s-compliment specific way:

int sum_max2(int a, unsigned char b) 
 unsigned int buffer;
 std::memcpy(&buffer, &a, sizeof(a));
 buffer += b;
 if (buffer > std::numeric_limits<int>::max())
 buffer = std::numeric_limits<int>::max();
 std::memcpy(&a, &buffer, sizeof(a));
 return a;

Resulting assembler can be seen here: https://godbolt.org/z/F42IXV

answered 2 hours ago

Richard Hodges

57.4k658105

add a comment |

The answer is actually in your question:

Yet most CPUs implement signed arithmetics with defined semantics

I can't think of a CPU that you can buy today that does not use twos-compliment arithmetic for signed integers, but that wasn't always the case.

The C language was invented in 1972. Back then, IBM 7090 mainframes still existed. Not all computers were twos-compliment.

To have defined the language (and overflow behaviour) around 2s-compliment would have been prejudicial to code generation on machines that weren't.

If I understand correctly that it's intended to clamp the sum of a and b to 0....INT_MAX without wraparound, I can think of two ways to write this function in a compliant way.

First, the inefficient general case that will work on all cpus:

int sum_max(int a, unsigned char b) 
 if (a > std::numeric_limits<int>::max() - b)
 return std::numeric_limits<int>::max();
 else
 return a + b;

Second, the surprisingly efficient 2s-compliment specific way:

int sum_max2(int a, unsigned char b) 
 unsigned int buffer;
 std::memcpy(&buffer, &a, sizeof(a));
 buffer += b;
 if (buffer > std::numeric_limits<int>::max())
 buffer = std::numeric_limits<int>::max();
 std::memcpy(&a, &buffer, sizeof(a));
 return a;

Resulting assembler can be seen here: https://godbolt.org/z/F42IXV

answered 2 hours ago

Richard Hodges

57.4k658105

add a comment |

The answer is actually in your question:

Yet most CPUs implement signed arithmetics with defined semantics

I can't think of a CPU that you can buy today that does not use twos-compliment arithmetic for signed integers, but that wasn't always the case.

The C language was invented in 1972. Back then, IBM 7090 mainframes still existed. Not all computers were twos-compliment.

To have defined the language (and overflow behaviour) around 2s-compliment would have been prejudicial to code generation on machines that weren't.

If I understand correctly that it's intended to clamp the sum of a and b to 0....INT_MAX without wraparound, I can think of two ways to write this function in a compliant way.

First, the inefficient general case that will work on all cpus:

int sum_max(int a, unsigned char b) 
 if (a > std::numeric_limits<int>::max() - b)
 return std::numeric_limits<int>::max();
 else
 return a + b;

Second, the surprisingly efficient 2s-compliment specific way:

int sum_max2(int a, unsigned char b) 
 unsigned int buffer;
 std::memcpy(&buffer, &a, sizeof(a));
 buffer += b;
 if (buffer > std::numeric_limits<int>::max())
 buffer = std::numeric_limits<int>::max();
 std::memcpy(&a, &buffer, sizeof(a));
 return a;

Resulting assembler can be seen here: https://godbolt.org/z/F42IXV

answered 2 hours ago

Richard Hodges

57.4k658105

The answer is actually in your question:

Yet most CPUs implement signed arithmetics with defined semantics

I can't think of a CPU that you can buy today that does not use twos-compliment arithmetic for signed integers, but that wasn't always the case.

The C language was invented in 1972. Back then, IBM 7090 mainframes still existed. Not all computers were twos-compliment.

To have defined the language (and overflow behaviour) around 2s-compliment would have been prejudicial to code generation on machines that weren't.

If I understand correctly that it's intended to clamp the sum of a and b to 0....INT_MAX without wraparound, I can think of two ways to write this function in a compliant way.

First, the inefficient general case that will work on all cpus:

int sum_max(int a, unsigned char b) 
 if (a > std::numeric_limits<int>::max() - b)
 return std::numeric_limits<int>::max();
 else
 return a + b;

Second, the surprisingly efficient 2s-compliment specific way:

int sum_max2(int a, unsigned char b) 
 unsigned int buffer;
 std::memcpy(&buffer, &a, sizeof(a));
 buffer += b;
 if (buffer > std::numeric_limits<int>::max())
 buffer = std::numeric_limits<int>::max();
 std::memcpy(&a, &buffer, sizeof(a));
 return a;

Resulting assembler can be seen here: https://godbolt.org/z/F42IXV

answered 2 hours ago

Richard Hodges

57.4k658105

answered 2 hours ago

Richard Hodges

57.4k658105

answered 2 hours ago

Richard Hodges

57.4k658105

answered 2 hours ago

Richard Hodges

57.4k658105

add a comment |

Signed integer expression simplification

Eliminate multiplication in comparison with 0

(x * c) cmp 0 -> x cmp 0

bool foo(int x) return x * 10 > 0

foo(int):
 test edi, edi
 setg al
 ret

Eliminate division after multiplication

(x * c1) / c2 -> x * (c1 / c2) if c1 is divisible by c2
```
int foo(int x) return (x * 20) / 10; 
```
```
foo(int):
 lea eax, [rdi+rdi]
 ret
```

Eliminate negation

(-x) / (-y) -> x / y

int foo(int x, int y) return (-x) / (-y);

foo(int, int):
 mov eax, edi
 cdq
 idiv esi
 ret

Simplify comparisons that are always true or false

x + c < x -> false
x + c <= x -> false
x + c > x -> true
x + c >= x -> true

bool foo(int x) return x + 10 >= x;

foo(int):
 mov eax, 1
 ret

Eliminate negation in comparisons

(-x) cmp (-y) -> y cmp x

bool foo(int x, int y) return -x < -y;

foo(int, int):
 cmp edi, esi
 setg al
 ret

Reduce magnitude of constants

x + c > y -> x + (c - 1) >= y
x + c <= y -> x + (c - 1) < y

bool foo(int x, int y) return x + 10 <= y;

foo(int, int):
 add edi, 9
 cmp edi, esi
 setl al
 ret

Eliminate constants in comparisons

(x + c1) cmp c2 -> x cmp (c2 - c1)
(x + c1) cmp (y + c2) -> x cmp (y + (c2 - c1)) if c1 <= c2
The second transformation is only valid if c1 <= c2, as it would
otherwise introduce an overflow when y has the value INT_MIN.

bool foo(int x) return x + 42 <= 11;

foo(int):
 cmp edi, -30
 setl al
 ret

Pointer arithmetic and type promotion

If an operation does not overflow, then we will get the same result if
we do the operation in a wider type. This is often useful when doing
things like array indexing on 64-bit architectures — the index
calculations are typically done using 32-bit int, but the pointers are
64-bit, and the compiler may generate more efficient code when signed
overflow is undefined by promoting the 32-bit integers to 64-bit
operations instead of generating type extensions.

One other aspect of this is that undefined overflow ensures that a[i]
and a[i+1] are adjacent. This improves analysis of memory accesses for
vectorization etc.

This is a very important optimization as loop vectorization one of the most efficient and effective optimization algorithms.

Value range calculations

The compiler keeps track of the variables' range of possible values at
each point in the program, i.e. for code such as
int x = foo();
if (x > 0) {
 int y = x + 5;
 int z = y / 4;
it determines that x has the range [1, INT_MAX] after the
if-statement, and can thus determine that y has the range [6, INT_MAX] as overflow is not allowed. And the next line can be
optimized to int z = y >> 2; as the compiler knows that y is
non-negative.

auto foo(int x)

 if (x <= 0)
 __builtin_unreachable();

 return (x + 5) / 4;

foo(int):
 lea eax, [rdi+5]
 sar eax, 2
 ret

The undefined overflow helps optimizations that need to compare two
values (as the wrapping case would give possible values of the form
[INT_MIN, (INT_MIN+4)] or [6, INT_MAX] that prevents all useful
comparisons with < or >), such as

Changing comparisons x<y to true or false if the ranges for x and y does not overlap

Changing min(x,y) or max(x,y) to x or y if the ranges do not overlap

Changing abs(x) to x or -x if the range does not cross 0

Changing x/c to x>>log2(c) if x>0 and the constant c is a power of 2

Changing x%c to x&(c-1) if x>0 and the constant c is a power of 2

Loop analysis and optimization

The canonical example of why undefined signed overflow helps loop
optimizations is that loops like
for (int i = 0; i <= m; i++)
are guaranteed to terminate for undefined overflow. This helps
architectures that have specific loop instructions, as they do in
general not handle infinite loops.

But undefined signed overflow helps many more loop optimizations. All
analysis such as determining number of iteration, transforming
induction variables, and keeping track of memory accesses are using
everything in the previous sections in order to do its work. In
particular, the set of loops that can be vectorized are severely
reduced when signed overflow is allowed.

edited 59 mins ago

answered 1 hour ago

bolov

34.4k878142

"x + 10 > x holds true now" - no it doesn't. It could format your hard disk if x happens to equal INT_MAX-5.

– immibis
18 mins ago

add a comment |

Signed integer expression simplification

Eliminate multiplication in comparison with 0

(x * c) cmp 0 -> x cmp 0

bool foo(int x) return x * 10 > 0

foo(int):
 test edi, edi
 setg al
 ret

Eliminate division after multiplication

(x * c1) / c2 -> x * (c1 / c2) if c1 is divisible by c2
```
int foo(int x) return (x * 20) / 10; 
```
```
foo(int):
 lea eax, [rdi+rdi]
 ret
```

Eliminate negation

(-x) / (-y) -> x / y

int foo(int x, int y) return (-x) / (-y);

foo(int, int):
 mov eax, edi
 cdq
 idiv esi
 ret

Simplify comparisons that are always true or false

x + c < x -> false
x + c <= x -> false
x + c > x -> true
x + c >= x -> true

bool foo(int x) return x + 10 >= x;

foo(int):
 mov eax, 1
 ret

Eliminate negation in comparisons

(-x) cmp (-y) -> y cmp x

bool foo(int x, int y) return -x < -y;

foo(int, int):
 cmp edi, esi
 setg al
 ret

Reduce magnitude of constants

x + c > y -> x + (c - 1) >= y
x + c <= y -> x + (c - 1) < y

bool foo(int x, int y) return x + 10 <= y;

foo(int, int):
 add edi, 9
 cmp edi, esi
 setl al
 ret

Eliminate constants in comparisons

(x + c1) cmp c2 -> x cmp (c2 - c1)
(x + c1) cmp (y + c2) -> x cmp (y + (c2 - c1)) if c1 <= c2
The second transformation is only valid if c1 <= c2, as it would
otherwise introduce an overflow when y has the value INT_MIN.

bool foo(int x) return x + 42 <= 11;

foo(int):
 cmp edi, -30
 setl al
 ret

Pointer arithmetic and type promotion

If an operation does not overflow, then we will get the same result if
we do the operation in a wider type. This is often useful when doing
things like array indexing on 64-bit architectures — the index
calculations are typically done using 32-bit int, but the pointers are
64-bit, and the compiler may generate more efficient code when signed
overflow is undefined by promoting the 32-bit integers to 64-bit
operations instead of generating type extensions.

One other aspect of this is that undefined overflow ensures that a[i]
and a[i+1] are adjacent. This improves analysis of memory accesses for
vectorization etc.

This is a very important optimization as loop vectorization one of the most efficient and effective optimization algorithms.

Value range calculations

The compiler keeps track of the variables' range of possible values at
each point in the program, i.e. for code such as
int x = foo();
if (x > 0) {
 int y = x + 5;
 int z = y / 4;
it determines that x has the range [1, INT_MAX] after the
if-statement, and can thus determine that y has the range [6, INT_MAX] as overflow is not allowed. And the next line can be
optimized to int z = y >> 2; as the compiler knows that y is
non-negative.

auto foo(int x)

 if (x <= 0)
 __builtin_unreachable();

 return (x + 5) / 4;

foo(int):
 lea eax, [rdi+5]
 sar eax, 2
 ret

The undefined overflow helps optimizations that need to compare two
values (as the wrapping case would give possible values of the form
[INT_MIN, (INT_MIN+4)] or [6, INT_MAX] that prevents all useful
comparisons with < or >), such as

Changing comparisons x<y to true or false if the ranges for x and y does not overlap

Changing min(x,y) or max(x,y) to x or y if the ranges do not overlap

Changing abs(x) to x or -x if the range does not cross 0

Changing x/c to x>>log2(c) if x>0 and the constant c is a power of 2

Changing x%c to x&(c-1) if x>0 and the constant c is a power of 2

Loop analysis and optimization

The canonical example of why undefined signed overflow helps loop
optimizations is that loops like
for (int i = 0; i <= m; i++)
are guaranteed to terminate for undefined overflow. This helps
architectures that have specific loop instructions, as they do in
general not handle infinite loops.

But undefined signed overflow helps many more loop optimizations. All
analysis such as determining number of iteration, transforming
induction variables, and keeping track of memory accesses are using
everything in the previous sections in order to do its work. In
particular, the set of loops that can be vectorized are severely
reduced when signed overflow is allowed.

edited 59 mins ago

answered 1 hour ago

bolov

34.4k878142

"x + 10 > x holds true now" - no it doesn't. It could format your hard disk if x happens to equal INT_MAX-5.

– immibis
18 mins ago

add a comment |

Signed integer expression simplification

Eliminate multiplication in comparison with 0

(x * c) cmp 0 -> x cmp 0

bool foo(int x) return x * 10 > 0

foo(int):
 test edi, edi
 setg al
 ret

Eliminate division after multiplication

(x * c1) / c2 -> x * (c1 / c2) if c1 is divisible by c2
```
int foo(int x) return (x * 20) / 10; 
```
```
foo(int):
 lea eax, [rdi+rdi]
 ret
```

Eliminate negation

(-x) / (-y) -> x / y

int foo(int x, int y) return (-x) / (-y);

foo(int, int):
 mov eax, edi
 cdq
 idiv esi
 ret

Simplify comparisons that are always true or false

x + c < x -> false
x + c <= x -> false
x + c > x -> true
x + c >= x -> true

bool foo(int x) return x + 10 >= x;

foo(int):
 mov eax, 1
 ret

Eliminate negation in comparisons

(-x) cmp (-y) -> y cmp x

bool foo(int x, int y) return -x < -y;

foo(int, int):
 cmp edi, esi
 setg al
 ret

Reduce magnitude of constants

x + c > y -> x + (c - 1) >= y
x + c <= y -> x + (c - 1) < y

bool foo(int x, int y) return x + 10 <= y;

foo(int, int):
 add edi, 9
 cmp edi, esi
 setl al
 ret

Eliminate constants in comparisons

(x + c1) cmp c2 -> x cmp (c2 - c1)
(x + c1) cmp (y + c2) -> x cmp (y + (c2 - c1)) if c1 <= c2
The second transformation is only valid if c1 <= c2, as it would
otherwise introduce an overflow when y has the value INT_MIN.

bool foo(int x) return x + 42 <= 11;

foo(int):
 cmp edi, -30
 setl al
 ret

Pointer arithmetic and type promotion

If an operation does not overflow, then we will get the same result if
we do the operation in a wider type. This is often useful when doing
things like array indexing on 64-bit architectures — the index
calculations are typically done using 32-bit int, but the pointers are
64-bit, and the compiler may generate more efficient code when signed
overflow is undefined by promoting the 32-bit integers to 64-bit
operations instead of generating type extensions.

One other aspect of this is that undefined overflow ensures that a[i]
and a[i+1] are adjacent. This improves analysis of memory accesses for
vectorization etc.

This is a very important optimization as loop vectorization one of the most efficient and effective optimization algorithms.

Value range calculations

The compiler keeps track of the variables' range of possible values at
each point in the program, i.e. for code such as
int x = foo();
if (x > 0) {
 int y = x + 5;
 int z = y / 4;
it determines that x has the range [1, INT_MAX] after the
if-statement, and can thus determine that y has the range [6, INT_MAX] as overflow is not allowed. And the next line can be
optimized to int z = y >> 2; as the compiler knows that y is
non-negative.

auto foo(int x)

 if (x <= 0)
 __builtin_unreachable();

 return (x + 5) / 4;

foo(int):
 lea eax, [rdi+5]
 sar eax, 2
 ret

The undefined overflow helps optimizations that need to compare two
values (as the wrapping case would give possible values of the form
[INT_MIN, (INT_MIN+4)] or [6, INT_MAX] that prevents all useful
comparisons with < or >), such as

Changing comparisons x<y to true or false if the ranges for x and y does not overlap

Changing min(x,y) or max(x,y) to x or y if the ranges do not overlap

Changing abs(x) to x or -x if the range does not cross 0

Changing x/c to x>>log2(c) if x>0 and the constant c is a power of 2

Changing x%c to x&(c-1) if x>0 and the constant c is a power of 2

Loop analysis and optimization

The canonical example of why undefined signed overflow helps loop
optimizations is that loops like
for (int i = 0; i <= m; i++)
are guaranteed to terminate for undefined overflow. This helps
architectures that have specific loop instructions, as they do in
general not handle infinite loops.

But undefined signed overflow helps many more loop optimizations. All
analysis such as determining number of iteration, transforming
induction variables, and keeping track of memory accesses are using
everything in the previous sections in order to do its work. In
particular, the set of loops that can be vectorized are severely
reduced when signed overflow is allowed.

edited 59 mins ago

answered 1 hour ago

bolov

34.4k878142

Signed integer expression simplification

Eliminate multiplication in comparison with 0

(x * c) cmp 0 -> x cmp 0

bool foo(int x) return x * 10 > 0

foo(int):
 test edi, edi
 setg al
 ret

Eliminate division after multiplication

(x * c1) / c2 -> x * (c1 / c2) if c1 is divisible by c2
```
int foo(int x) return (x * 20) / 10; 
```
```
foo(int):
 lea eax, [rdi+rdi]
 ret
```

Eliminate negation

(-x) / (-y) -> x / y

int foo(int x, int y) return (-x) / (-y);

foo(int, int):
 mov eax, edi
 cdq
 idiv esi
 ret

Simplify comparisons that are always true or false

x + c < x -> false
x + c <= x -> false
x + c > x -> true
x + c >= x -> true

bool foo(int x) return x + 10 >= x;

foo(int):
 mov eax, 1
 ret

Eliminate negation in comparisons

(-x) cmp (-y) -> y cmp x

bool foo(int x, int y) return -x < -y;

foo(int, int):
 cmp edi, esi
 setg al
 ret

Reduce magnitude of constants

x + c > y -> x + (c - 1) >= y
x + c <= y -> x + (c - 1) < y

bool foo(int x, int y) return x + 10 <= y;

foo(int, int):
 add edi, 9
 cmp edi, esi
 setl al
 ret

Eliminate constants in comparisons

(x + c1) cmp c2 -> x cmp (c2 - c1)
(x + c1) cmp (y + c2) -> x cmp (y + (c2 - c1)) if c1 <= c2
The second transformation is only valid if c1 <= c2, as it would
otherwise introduce an overflow when y has the value INT_MIN.

bool foo(int x) return x + 42 <= 11;

foo(int):
 cmp edi, -30
 setl al
 ret

Pointer arithmetic and type promotion

If an operation does not overflow, then we will get the same result if
we do the operation in a wider type. This is often useful when doing
things like array indexing on 64-bit architectures — the index
calculations are typically done using 32-bit int, but the pointers are
64-bit, and the compiler may generate more efficient code when signed
overflow is undefined by promoting the 32-bit integers to 64-bit
operations instead of generating type extensions.

One other aspect of this is that undefined overflow ensures that a[i]
and a[i+1] are adjacent. This improves analysis of memory accesses for
vectorization etc.

This is a very important optimization as loop vectorization one of the most efficient and effective optimization algorithms.

Value range calculations

The compiler keeps track of the variables' range of possible values at
each point in the program, i.e. for code such as
int x = foo();
if (x > 0) {
 int y = x + 5;
 int z = y / 4;
it determines that x has the range [1, INT_MAX] after the
if-statement, and can thus determine that y has the range [6, INT_MAX] as overflow is not allowed. And the next line can be
optimized to int z = y >> 2; as the compiler knows that y is
non-negative.

auto foo(int x)

 if (x <= 0)
 __builtin_unreachable();

 return (x + 5) / 4;

foo(int):
 lea eax, [rdi+5]
 sar eax, 2
 ret

The undefined overflow helps optimizations that need to compare two
values (as the wrapping case would give possible values of the form
[INT_MIN, (INT_MIN+4)] or [6, INT_MAX] that prevents all useful
comparisons with < or >), such as

Changing comparisons x<y to true or false if the ranges for x and y does not overlap

Changing min(x,y) or max(x,y) to x or y if the ranges do not overlap

Changing abs(x) to x or -x if the range does not cross 0

Changing x/c to x>>log2(c) if x>0 and the constant c is a power of 2

Changing x%c to x&(c-1) if x>0 and the constant c is a power of 2

Loop analysis and optimization

The canonical example of why undefined signed overflow helps loop
optimizations is that loops like
for (int i = 0; i <= m; i++)
are guaranteed to terminate for undefined overflow. This helps
architectures that have specific loop instructions, as they do in
general not handle infinite loops.

But undefined signed overflow helps many more loop optimizations. All
analysis such as determining number of iteration, transforming
induction variables, and keeping track of memory accesses are using
everything in the previous sections in order to do its work. In
particular, the set of loops that can be vectorized are severely
reduced when signed overflow is allowed.

edited 59 mins ago

answered 1 hour ago

bolov

34.4k878142

edited 59 mins ago

answered 1 hour ago

bolov

34.4k878142

answered 1 hour ago

bolov

34.4k878142

answered 1 hour ago

bolov

34.4k878142

"x + 10 > x holds true now" - no it doesn't. It could format your hard disk if x happens to equal INT_MAX-5.

– immibis
18 mins ago

add a comment |

"x + 10 > x holds true now" - no it doesn't. It could format your hard disk if x happens to equal INT_MAX-5.

– immibis
18 mins ago

"x + 10 > x holds true now" - no it doesn't. It could format your hard disk if x happens to equal INT_MAX-5.

– immibis
18 mins ago

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Etydhv

3 Answers
3

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Your Answer

Post as a guest

3 Answers
3

3 Answers
3

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Post as a guest

Popular posts from this blog

Siegen Nawigatsjuun

Log på Navigationsmenu

Relaterede ændringer Navigationsmenu

3 Answers 3

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Your Answer

Sign up or log in

Post as a guest

Post as a guest

3 Answers 3

3 Answers 3

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Signed integer expression simplification

Pointer arithmetic and type promotion

Value range calculations

Loop analysis and optimization

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

Siegen Nawigatsjuun

Log på Navigationsmenu

Relaterede ændringer Navigationsmenu

3 Answers
3

3 Answers
3

3 Answers
3