Avoid buffered I/O #4

RinHizakura · 2021-06-08T06:42:19Z

Here's a macro solution to avoid unexpected output by the buffered I/O printf. It could have some limitations and need improvement, but it achieves the purpose of output in text lines as we may want.

fiber/fiber.c

jserv · 2021-06-08T06:45:11Z

fiber/fiber.c

@@ -119,13 +119,24 @@ int fiber_wait_all()
    return FIBER_NOERROR;
 }

+/* A simple solution to avoid unexpected output by the buffered I/O printf is


You shall explain why this wrapper is "safe."

Actually, I am thinking to change the name for the macro. I use the word "safe" to represent that the output can show as we may want. But as far as I know, printf is said to be thread-safe in the Linux manual. That means it won't corrupt anything when using in the multithreading. However, some interleaving that doesn't change the outcome is still allowed to happen, which can produce the mixing output between threads.

So I am not sure if the name print_safe will mislead someone to the concept that printf is not thread-safe. Do I misunderstand something or any advice for this?

It was meant to be "safe." Instead, its intention is to eliminate the impact of buffered I/O, which causes the unexpected sequence of strings.

While the printf provided by glibc is a buffered output function, the proposed change does "direct" format string manipulation. Therefore, you can rename printf_safe to printf_unbuffered and then use #define printf printf_unbuffered for build-time substitution.

Reference: CS:APP Chapter 10

fiber/fiber.c

jserv · 2021-06-09T14:39:06Z

fiber/fiber.c

+/* A simple solution to avoid unexpected output by the buffered I/O 'printf' is
+ * using low-level I/O interface 'write'. It works because 'printf' will wait to
+ * write STDOUT until the buffer is full, or on some other conditions. Using
+ * write, however, can write STDOUT immediately.


Change "write STDOUT" to "write to STDOUT"

RinHizakura · 2021-06-09T15:22:19Z

When I try to go deep on my solution for the interleaved output, I think I may have some wrong thoughts when I design this macro. I first used this as the solution because I think write won't be interrupted except the signal handler during the execution, so it can output the whole given string immediately without interleaving. But soon after some exploration and thinking, I doubt if I have to handle the possibility that write may write less than the request n bytes.

Although for my result, the output contents do show as expected, but I'm not sure if this really solves the problem. I may need time to clarify my own question.

A simple solution to avoid unexpected output by the buffered I/O 'printf' is using low-level I/O interface 'write'. It works because 'printf' will wait to write to STDOUT until the buffer is full, or on some other conditions. Using write, however, can write to STDOUT immediately. Here is a naive implementation for the idea with some limitation and weakness that need improvement: 1. It will fail if the formatted string with length >64 2. The function 'write' can write less than n bytes. It will need further handling if happens.

Currently, the reader_func is counting the number of grace periods based on the value of dut, which is updated by mthpc_rcu_replace_pointer. However, the value of dut actually represents the time we update the value, not the number of grace periods. Also, the original method might result in incorrect counting if someone tried to update the gp_idx while others who saw the same dut value with prev_count still depend on the old gp_idx to increase the counter. To fix the problem, instead of relying on the dut value to increase the gp_idx, we manually increase gp_idx on write side. Then, we can easily determine the gp on read side. For dut value, we simply check the old count value is not greater than the newest one. Additionally, since synchronize_rcu is quite slow, readers generally will pass through the critical section during the first grace period. To generate more realistic output, we add a delay on read side before entering the critical section. Before: 100 reader(s), 5 update run(s), 6 grace period(s) [grace period #0] 100 reader(s) [grace period sysprog21#1] 0 reader(s) [grace period sysprog21#2] 0 reader(s) [grace period sysprog21#3] 0 reader(s) [grace period sysprog21#4] 0 reader(s) [grace period sysprog21#5] 0 reader(s) After, we added a delay: 100 reader(s), 5 update run(s), 6 grace period(s) [grace period #0] 76 reader(s) [grace period sysprog21#1] 0 reader(s) [grace period sysprog21#2] 1 reader(s) [grace period sysprog21#3] 0 reader(s) [grace period sysprog21#4] 3 reader(s) [grace period sysprog21#5] 20 reader(s)

Currently, the reader_func is counting the number of grace periods based on the value of dut, which is updated by rcu_assign_pointer. However, the value of dut actually represents the time we update the value, not the number of grace periods. Also, the original method might result in incorrect counting if someone tried to update the gp_idx while others who saw the same dut value with prev_count still depend on the old gp_idx to increase the counter. To fix the problem, instead of relying on the dut value to increase the gp_idx, we manually increase gp_idx on write side. Then, we can easily determine the gp on read side. For dut value, we simply check the old count value is not greater than the newest one. Additionally, since synchronize_rcu is quite slow, readers generally will pass through the critical section during the first grace period. To generate more realistic output, we add a delay on read side before entering the critical section. Before: 100 reader(s), 5 update run(s), 6 grace period(s) [grace period #0] 100 reader(s) [grace period sysprog21#1] 0 reader(s) [grace period sysprog21#2] 0 reader(s) [grace period sysprog21#3] 0 reader(s) [grace period sysprog21#4] 0 reader(s) [grace period sysprog21#5] 0 reader(s) After, we added a delay: 100 reader(s), 5 update run(s), 6 grace period(s) [grace period #0] 76 reader(s) [grace period sysprog21#1] 0 reader(s) [grace period sysprog21#2] 1 reader(s) [grace period sysprog21#3] 0 reader(s) [grace period sysprog21#4] 3 reader(s) [grace period sysprog21#5] 20 reader(s)

jserv reviewed Jun 8, 2021

View reviewed changes

fiber/fiber.c Outdated Show resolved Hide resolved

jserv reviewed Jun 8, 2021

View reviewed changes

fiber/fiber.c Outdated Show resolved Hide resolved

jserv reviewed Jun 8, 2021

View reviewed changes

fiber/fiber.c Outdated Show resolved Hide resolved

RinHizakura force-pushed the pr_fiber branch 3 times, most recently from c949298 to 521032e Compare June 9, 2021 14:27

jserv reviewed Jun 9, 2021

View reviewed changes

RinHizakura force-pushed the pr_fiber branch from 521032e to e96ec90 Compare June 9, 2021 15:23

jserv changed the title ~~Add safe print for fiber~~ Avoid buffered I/O Jun 9, 2021

RinHizakura force-pushed the pr_fiber branch 2 times, most recently from fd082b4 to 4f4031a Compare June 9, 2021 16:44

RinHizakura force-pushed the pr_fiber branch from 4f4031a to 7226451 Compare June 9, 2021 16:45

jserv merged commit 187c5aa into sysprog21:master Jun 9, 2021

RinHizakura deleted the pr_fiber branch August 7, 2021 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid buffered I/O #4

Avoid buffered I/O #4

Uh oh!

RinHizakura commented Jun 8, 2021

Uh oh!

Uh oh!

jserv Jun 8, 2021

Uh oh!

RinHizakura Jun 8, 2021

Uh oh!

jserv Jun 9, 2021

Uh oh!

jserv Jun 9, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jserv Jun 9, 2021

Uh oh!

RinHizakura commented Jun 9, 2021

Uh oh!

Uh oh!

Avoid buffered I/O #4

Avoid buffered I/O #4

Uh oh!

Conversation

RinHizakura commented Jun 8, 2021

Uh oh!

Uh oh!

jserv Jun 8, 2021

Choose a reason for hiding this comment

Uh oh!

RinHizakura Jun 8, 2021

Choose a reason for hiding this comment

Uh oh!

jserv Jun 9, 2021

Choose a reason for hiding this comment

Uh oh!

jserv Jun 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jserv Jun 9, 2021

Choose a reason for hiding this comment

Uh oh!

RinHizakura commented Jun 9, 2021

Uh oh!

Uh oh!

jserv Jun 9, 2021 •

edited

Loading