nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-05-14 02:49:40 +00:00

Author	SHA1	Message	Date
Lukas Kalbertodt	8a18fb0f73	Stabilize `Seek::stream_position` & change feature of `Seek::stream_len`	2021-01-24 10:14:24 +01:00
Yuki Okushi	9abd746a32	Rollup merge of #80172 - camelid:prelude-docs-consistent-punct, r=steveklabnik Use consistent punctuation for 'Prelude contents' docs	2021-01-21 20:04:39 +09:00
bors	cf04ae54e6	Auto merge of #79705 - ijackson:bufwriter-disassemble, r=m-ou-se BufWriter: Provide into_raw_parts If something goes wrong, one might want to unpeel the layers of nested Writers to perform recovery actions on the underlying writer, or reuse its resources. `into_inner` can be used for this when the inner writer is still working. But when the inner writer is broken, and returning errors, `into_inner` simply gives you the error from flush, and the same `Bufwriter` back again. Here I provide the necessary function, which I have chosen to call `into_raw_parts`. I had to do something with `panicked`. Returning it to the caller as a boolean seemed rather bare. Throwing the buffered data away in this situation also seems unfriendly: maybe the programmer knows something about the underlying writer and can recover somehow. So I went for a custom Error. This may be overkill, but it does have the nice property that a caller who actually wants to look at the buffered data, rather than simply extracting the inner writer, will be told by the type system if they forget to handle the panicked case. If a caller doesn't need the buffer, it can just be discarded. That WriterPanicked is a newtype around Vec<u8> means that hopefully the layouts of the Ok and Err variants can be very similar, with just a boolean discriminant. So this custom error type should compile down to nearly no code. If this general idea is felt appropriate, I will open a tracking issue, etc.	2021-01-19 16:42:19 +00:00
Ben Kimock	4e27ed3af1	Add benchmark and fast path for BufReader::read_exact	2021-01-17 12:10:39 +10:00
Mara Bos	ce48709405	Rollup merge of #80895 - sfackler:read-to-end-ub, r=m-ou-se Fix handling of malicious Readers in read_to_end A malicious `Read` impl could return overly large values from `read`, which would result in the guard's drop impl setting the buffer's length to greater than its capacity! ~~To fix this, the drop impl now uses the safe `truncate` function instead of `set_len` which ensures that this will not happen. The result of calling the function will be nonsensical, but that's fine given the contract violation of the `Read` impl.~~ ~~The `Guard` type is also used by `append_to_string` which does not pass untrusted values into the length field, so I've copied the guard type into each function and only modified the one used by `read_to_end`. We could just keep a single one and modify it, but it seems a bit cleaner to keep the guard code close to the functions and related specifically to them.~~ To fix this, we now assert that the returned length is not larger than the buffer passed to the method. For reference, this bug has been present for ~2.5 years since 1.20: `ecbb896b9e`. Closes #80894.	2021-01-14 18:00:11 +00:00
Mara Bos	9fc298ca89	Rollup merge of #80217 - camelid:io-read_to_string, r=m-ou-se Add a `std::io::read_to_string` function I recognize that you're usually supposed to open an issue first, but the implementation is very small so it's okay if this is closed and it was 'wasted work' :) ----- The equivalent of `std::fs::read_to_string`, but generalized to all `Read` impls. As the documentation on `std::io::read_to_string` says, the advantage of this function is that it means you don't have to create a variable first and it provides more type safety since you can only get the buffer out if there were no errors. If you use `Read::read_to_string`, you have to remember to check whether the read succeeded because otherwise your buffer will be empty. It's friendlier to newcomers and better in most cases to use an explicit return value instead of an out parameter.	2021-01-14 18:00:00 +00:00
Camelid	7463292015	Add docs on performance	2021-01-11 19:18:39 -08:00
Steven Fackler	e6c07b0628	clarify docs a bit	2021-01-11 17:16:44 -05:00
Steven Fackler	5cb830397e	make check a bit more clear	2021-01-11 17:13:50 -05:00
Steven Fackler	a9ef7983a6	clean up control flow	2021-01-11 07:48:24 -05:00
Steven Fackler	ebe402dc9e	Fix handling of malicious Readers in read_to_end	2021-01-11 07:27:03 -05:00
Camelid	25a4964191	Use heading for `std::prelude` and not `io::prelude` The heading style for `std::prelude` is to be consistent with the headings for `std` and `core`: `# The Rust Standard Library` and `# The Rust Core Library`, respectively.	2021-01-05 17:52:24 -08:00
Ian Jackson	dea6d6c909	BufWriter::into_raw_parts: Add tracking issue number Signed-off-by: Ian Jackson <ijackson@chiark.greenend.org.uk>	2021-01-04 15:35:28 +00:00
Camelid	0506789014	Remove many unnecessary manual link resolves from library Now that #76934 has merged, we can remove a lot of these! E.g, this is no longer necessary: [`Vec<T>`]: Vec	2020-12-31 11:54:32 -08:00
Camelid	588786a788	Add error docs	2020-12-30 11:44:03 -08:00
Camelid	4ee6d1bf54	Add description independent of `Read::read_to_string`	2020-12-30 11:35:17 -08:00
Camelid	1f9a8a1620	Add a `std::io::read_to_string` function The equivalent of `std::fs::read_to_string`, but generalized to all `Read` impls. As the documentation on `std::io::read_to_string` says, the advantage of this function is that it means you don't have to create a variable first and it provides more type safety since you can only get the buffer out if there were no errors. If you use `Read::read_to_string`, you have to remember to check whether the read succeeded because otherwise your buffer will be empty. It's friendlier to newcomers and better in most cases to use an explicit return value instead of an out parameter.	2020-12-19 21:46:40 -08:00
Camelid	4a6014bc28	Use heading style for 'The I/O Prelude' in `std::io::prelude`	2020-12-18 15:05:15 -08:00
Ian Jackson	79c72f57d5	fixup! WriterPanicked: Use debug_struct	2020-12-12 18:39:30 +00:00
Ian Jackson	5ac431fb08	WriterPanicked: Use debug_struct Co-authored-by: Ivan Tham <pickfire@riseup.net>	2020-12-12 13:37:29 +00:00
Ian Jackson	7fab9cb8ac	bufwriter::WriterPanicked: Provide panicking example Signed-off-by: Ian Jackson <ijackson@chiark.greenend.org.uk>	2020-12-12 12:34:48 +00:00
bors	8cef65fde3	Auto merge of #77801 - fusion-engineering-forks:pin-mutex, r=Mark-Simulacrum Enforce no-move rule of ReentrantMutex using Pin and fix UB in stdio A `sys_common::ReentrantMutex` may not be moved after initializing it with `.init()`. This was not enforced, but only stated as a requirement in the comments on the unsafe functions. This change enforces this no-moving rule using `Pin`, by changing `&self` to a `Pin` in the `init()` and `lock()` functions. This uncovered a bug I introduced in #77154: stdio.rs (the only user of ReentrantMutex) called `init()` on its ReentrantMutexes while constructing them in the intializer of `SyncOnceCell::get_or_init`, which would move them afterwards. Interestingly, the ReentrantMutex unit tests already had the same bug, so this invalid usage has been tested on all (CI-tested) platforms for a long time. Apparently this doesn't break badly on any of the major platforms, but it does break the rules.\* To be able to keep using SyncOnceCell, this adds a `SyncOnceCell::get_or_init_pin` function, which makes it possible to work with pinned values inside a (pinned) SyncOnceCell. Whether this function should be public or not and what its exact behaviour and interface should be if it would be public is something I'd like to leave for a separate issue or PR. In this PR, this function is internal-only and marked with `pub(crate)`. \* Note: That bug is now included in 1.48, while this patch can only make it to ~~1.49~~ 1.50. We should consider the implications of 1.48 shipping with a wrong usage of `pthread_mutex_t` / `CRITICAL_SECTION` / .. which technically invokes UB according to their specification. The risk is very low, considering the objects are not 'used' (locked) before the move, and the ReentrantMutex unit tests have verified this works fine in practice. Edit: This has been backported and included in 1.48. And soon 1.49 too. --- In future changes, I want to push this usage of Pin further inside `sys` instead of only `sys_common`, and apply it to all 'unmovable' objects there (`Mutex`, `Condvar`, `RwLock`). Also, while `sys_common`'s mutexes and condvars are already taken care of by #77147 and #77648, its `RwLock` should still be made movable or get pinned.	2020-12-10 23:43:20 +00:00
bors	2c56ea38b0	Auto merge of #78768 - mzabaluev:optimize-buf-writer, r=cramertj Use is_write_vectored to optimize the write_vectored implementation for BufWriter In case when the underlying writer does not have an efficient implementation `write_vectored`, the present implementation of `write_vectored` for `BufWriter` may still forward vectored writes directly to the writer depending on the total length of the data. This misses the advantage of buffering, as the actually written slice may be small. Provide an alternative code path for the non-vectored case, where the slices passed to `BufWriter` are coalesced in the buffer before being flushed to the underlying writer with plain `write` calls. The buffer is only bypassed if an individual slice's length is at least as large as the buffer. Remove a FIXME comment referring to #72919 as the issue has been closed with an explanation provided.	2020-12-09 01:54:08 +00:00
Mara Bos	67c18fdec5	Use Pin for the 'don't move' requirement of ReentrantMutex. The code in io::stdio before this change misused the ReentrantMutexes, by calling init() on them and moving them afterwards. Now that ReentrantMutex requires Pin for init(), this mistake is no longer easy to make.	2020-12-08 22:57:57 +01:00
Mara Bos	2bc5d44ca9	Fix outdated comment about not needing to flush stderr.	2020-12-08 22:57:49 +01:00
Ian Jackson	b777552167	IntoInnerError: Provide into_error Signed-off-by: Ian Jackson <ijackson@chiark.greenend.org.uk>	2020-12-04 18:43:02 +00:00
Ian Jackson	19c7619dcd	IntoInnerError: Provide into_parts In particular, IntoIneerError only currently provides .error() which returns a reference, not an owned value. This is not helpful and means that a caller of BufWriter::into_inner cannot acquire an owned io::Error which seems quite wrong. Signed-off-by: Ian Jackson <ijackson@chiark.greenend.org.uk>	2020-12-04 18:43:02 +00:00
Ian Jackson	db5d697004	std: impl of `Write` for `&mut [u8]`: document the buffer full error Signed-off-by: Ian Jackson <ijackson@chiark.greenend.org.uk>	2020-12-04 18:38:44 +00:00
Ian Jackson	381763185e	BufWriter: Provide into_raw_parts If something goes wrong, one might want to unpeel the layers of nested Writers to perform recovery actions on the underlying writer, or reuse its resources. `into_inner` can be used for this when the inner writer is still working. But when the inner writer is broken, and returning errors, `into_inner` simply gives you the error from flush, and the same `Bufwriter` back again. Here I provide the necessary function, which I have chosen to call `into_raw_parts`. I had to do something with `panicked`. Returning it to the caller as a boolean seemed rather bare. Throwing the buffered data away in this situation also seems unfriendly: maybe the programmer knows something about the underlying writer and can recover somehow. So I went for a custom Error. This may be overkill, but it does have the nice property that a caller who actually wants to look at the buffered data, rather than simply extracting the inner writer, will be told by the type system if they forget to handle the panicked case. If a caller doesn't need the buffer, it can just be discarded. That WriterPanicked is a newtype around Vec<u8> means that hopefully the layouts of the Ok and Err variants can be very similar, with just a boolean discriminant. So this custom error type should compile down to nearly no code. Signed-off-by: Ian Jackson <ijackson@chiark.greenend.org.uk>	2020-12-04 18:28:02 +00:00
Mikhail Zabaluev	674dd623ee	Reduce branching in write_vectored for BufWriter Do what write does and optimize for the most likely case: slices are much smaller than the buffer. If a slice does not fit completely in the remaining capacity of the buffer, it is left out rather than buffered partially. Special treatment is only left for oversized slices that are written directly to the underlying writer.	2020-11-22 17:05:14 +02:00
Mikhail Zabaluev	00deeb35c8	Fix is_write_vectored in LineWriterShim Now that BufWriter always claims to support vectored writes, look through it at the wrapped writer to decide whether to use vectored writes for LineWriter.	2020-11-22 17:05:14 +02:00
Mikhail Zabaluev	9fc44239ec	Make is_write_vectored return true for BufWriter BufWriter provides an efficient implementation of write_vectored also when the underlying writer does not support vectored writes.	2020-11-22 17:05:13 +02:00
Mikhail Zabaluev	53196a8bcf	Optimize write_vectored for BufWriter If the underlying writer does not support efficient vectored output, do it differently: always try to coalesce the slices in the buffer until one comes that does not fit entirely. Flush the buffer before the first slice if needed.	2020-11-22 17:05:13 +02:00
William Chargin	bdaa76cfde	Fix typo in `std::io::Write` docs These referred to a “`Write`er”—extra e. Presumably a copy-paste holdover from “`Read`er”. Test Plan: Running ``git grep '`\?[Ww]rite`\?er'`` no longer finds any results. wchargin-branch: io-write-docs	2020-11-17 15:32:23 -08:00
Mara Bos	11ce918c75	Rollup merge of #78714 - m-ou-se:simplify-local-streams, r=KodrAus Simplify output capturing This is a sequence of incremental improvements to the unstable/internal `set_panic` and `set_print` mechanism used by the `test` crate: 1. Remove the `LocalOutput` trait and use `Arc<Mutex<dyn Write>>` instead of `Box<dyn LocalOutput>`. In practice, all implementations of `LocalOutput` were just `Arc<Mutex<..>>`. This simplifies some logic and removes all custom `Sink` implementations such as `library/test/src/helpers/sink.rs`. Also removes a layer of indirection, as the outermost `Box` is now gone. It also means that locking now happens per `write_fmt`, not per individual `write` within. (So `"{} {}\n"` now results in one `lock()`, not four or more.) 2. Since in all cases the `dyn Write`s were just `Vec<u8>`s, replace the type with `Arc<Mutex<Vec<u8>>>`. This simplifies things more, as error handling and flushing can be removed now. This also removes the hack needed in the default panic handler to make this work with `::realstd`, as (unlike `Write`) `Vec<u8>` is from `alloc`, not `std`. 3. Replace the `RefCell`s by regular `Cell`s. The `RefCell`s were mostly used as `mem::replace(&mut *cell.borrow_mut(), something)`, which is just `Cell::replace`. This removes an unecessary bookkeeping and makes the code a bit easier to read. 4. Merge `set_panic` and `set_print` into a single `set_output_capture`. Neither the test crate nor rustc (the only users of this feature) have a use for using these separately. Merging them simplifies things even more. This uses a new function name and feature name, to make it clearer this is internal and not supposed to be used by other crates. Might be easier to review per commit.	2020-11-16 17:26:27 +01:00
bors	30e49a9ead	Auto merge of #75272 - the8472:spec-copy, r=KodrAus specialize io::copy to use copy_file_range, splice or sendfile Fixes #74426. Also covers #60689 but only as an optimization instead of an official API. The specialization only covers std-owned structs so it should avoid the problems with #71091 Currently linux-only but it should be generalizable to other unix systems that have sendfile/sosplice and similar. There is a bit of optimization potential around the syscall count. Right now it may end up doing more syscalls than the naive copy loop when doing short (<8KiB) copies between file descriptors. The test case executes the following: ``` [pid 103776] statx(3, "", AT_STATX_SYNC_AS_STAT\|AT_EMPTY_PATH, STATX_ALL, {stx_mask=STATX_ALL\|STATX_MNT_ID, stx_attributes=0, stx_mode=S_IFREG\|0644, stx_size=17, ...}) = 0 [pid 103776] write(4, "wxyz", 4) = 4 [pid 103776] write(4, "iklmn", 5) = 5 [pid 103776] copy_file_range(3, NULL, 4, NULL, 5, 0) = 5 ``` 0-1 `stat` calls to identify the source file type. 0 if the type can be inferred from the struct from which the FD was extracted 𝖬 `write` to drain the `BufReader`/`BufWriter` wrappers. only happen when buffers are present. 𝖬 ≾ number of wrappers present. If there is a write buffer it may absorb the read buffer contents first so only result in a single write. Vectored writes would also be an option but that would require more invasive changes to `BufWriter`. 𝖭 `copy_file_range`/`splice`/`sendfile` until file size, EOF or the byte limit from `Take` is reached. This should generally be much more efficient than the read-write loop and also have other benefits such as DMA offload or extent sharing. ## Benchmarks ``` OLD test io::tests::bench_file_to_file_copy ... bench: 21,002 ns/iter (+/- 750) = 6240 MB/s [ext4] test io::tests::bench_file_to_file_copy ... bench: 35,704 ns/iter (+/- 1,108) = 3671 MB/s [btrfs] test io::tests::bench_file_to_socket_copy ... bench: 57,002 ns/iter (+/- 4,205) = 2299 MB/s test io::tests::bench_socket_pipe_socket_copy ... bench: 142,640 ns/iter (+/- 77,851) = 918 MB/s NEW test io::tests::bench_file_to_file_copy ... bench: 14,745 ns/iter (+/- 519) = 8889 MB/s [ext4] test io::tests::bench_file_to_file_copy ... bench: 6,128 ns/iter (+/- 227) = 21389 MB/s [btrfs] test io::tests::bench_file_to_socket_copy ... bench: 13,767 ns/iter (+/- 3,767) = 9520 MB/s test io::tests::bench_socket_pipe_socket_copy ... bench: 26,471 ns/iter (+/- 6,412) = 4951 MB/s ```	2020-11-14 12:01:55 +00:00
The8472	888b1031bc	limit visibility of copy offload helpers to sys::unix module	2020-11-13 22:38:27 +01:00
The8472	18bfe2a66b	move copy specialization tests to their own module	2020-11-13 22:38:27 +01:00
The8472	7f5d2722af	move copy specialization into sys::unix module	2020-11-13 22:38:23 +01:00
The8472	ad9b07c7e5	add benchmarks	2020-11-13 19:46:37 +01:00
The8472	46e7fbe60b	reduce syscalls by inferring FD types based on source struct instead of calling stat() also adds handling for edge-cases involving large sparse files where sendfile could fail with EOVERFLOW	2020-11-13 19:46:35 +01:00
The8472	0624730d9e	add forwarding specializations for &mut variants `impl Write for &mut T where T: Write`, thus the same should apply to the specialization traits	2020-11-13 19:45:38 +01:00
The8472	cd3bddc044	prioritize sendfile over splice since it results in fewer context switches when sending to pipes splice returns to userspace when the pipe is full, sendfile just blocks until it's done, this can achieve much higher throughput	2020-11-13 19:45:38 +01:00
The8472	67a6059aa5	move tests module into separate file	2020-11-13 19:45:38 +01:00
The8472	5eb88fa5c7	hide unused exports on other platforms	2020-11-13 19:45:38 +01:00
The8472	16236470c1	specialize io::copy to use copy_file_range, splice or sendfile Currently it only applies to linux systems. It can be extended to make use of similar syscalls on other unix systems.	2020-11-13 19:45:27 +01:00
Mara Bos	aff7bd66e8	Merge set_panic and set_print into set_output_capture. There were no use cases for setting them separately. Merging them simplifies some things.	2020-11-10 21:58:13 +01:00
Mara Bos	08b7cb79e0	Use Cell instead of RefCell for LOCAL_{STDOUT,STDERR}.	2020-11-10 21:58:13 +01:00
Mara Bos	f534b75f05	Use Vec<u8> for LOCAL_STD{OUT,ERR} instead of dyn Write. It was only ever used with Vec<u8> anyway. This simplifies some things. - It no longer needs to be flushed, because that's a no-op anyway for a Vec<u8>. - Writing to a Vec<u8> never fails. - No #[cfg(test)] code is needed anymore to use `realstd` instead of `std`, because Vec comes from alloc, not std (like Write).	2020-11-10 21:58:09 +01:00
Mara Bos	72e96604c0	Remove io::LocalOutput and use Arc<Mutex<dyn>> for local streams.	2020-11-10 21:57:05 +01:00
Mara Bos	77f333b304	Rollup merge of #78811 - a1phyr:const_io_structs, r=dtolnay Make some std::io functions `const` Tracking issue: #78812 Make the following functions `const`: - `io::Cursor::new` - `io::Cursor::get_ref` - `io::Cursor::position` - `io::empty` - `io::repeat` - `io::sink` r? `````@dtolnay`````	2020-11-08 13:36:19 +01:00
Benoît du Garreau	001dd7e6a5	Add tracking issue	2020-11-06 18:04:52 +01:00
Benoît du Garreau	ae059b532f	Make some std::io functions `const` Includes: - io::Cursor::new - io::Cursor::get_ref - io::Cursor::position - io::empty - io::repeat - io::sink	2020-11-06 17:48:26 +01:00
Peter Jaszkowiak	8d48e3bbb2	document HACKs	2020-11-05 19:26:08 -07:00
Peter Jaszkowiak	fe6dfcd28a	Intra-doc links for std::io::buffered	2020-11-05 19:09:42 -07:00
bors	56d288fa46	Auto merge of #78227 - SergioBenitez:test-stdout-threading, r=m-ou-se Capture output from threads spawned in tests This is revival of #75172. Original text: > Fixes #42474. > > r? `@dtolnay` since you expressed interest in this, but feel free to redirect if you aren't the right person anymore. --- Closes #75172.	2020-10-27 11:43:18 +00:00
Michele Lacchia	a4ba179bdd	fix(docs): typo in BufWriter documentation	2020-10-26 11:13:47 +01:00
Sergio Benitez	db15596c57	Only load LOCAL_STREAMS if they are being used	2020-10-22 18:15:48 -07:00
Tyler Mandry	d0d0e78208	Capture output from threads spawned in tests Fixes #42474.	2020-10-22 18:15:44 -07:00
Dylan DPC	5acb7f198f	Rollup merge of #76084 - Lucretiel:split-buffered, r=dtolnay Refactor io/buffered.rs into submodules This pull request splits `BufWriter`, `BufReader`, `LineWriter`, and `LineWriterShim` (along with their associated tests) into separate submodules. It contains no functional changes. This change is being made in anticipation of adding another type of buffered writer which can be switched between line- and block-buffering mode. Part of a series of pull requests resolving #60673.	2020-10-16 02:10:04 +02:00
Mara Bos	de597fca40	Optimize set_{panic,print}(None).	2020-09-27 16:04:25 +02:00
Mara Bos	ed3ead013f	Relax memory ordering of LOCAL_STREAMS and document it.	2020-09-27 16:04:25 +02:00
Mara Bos	07fd17f701	Only use LOCAL_{STDOUT,STDERR} when set_{print/panic} is used. The thread local LOCAL_STDOUT and LOCAL_STDERR are only used by the test crate to capture output from tests when running them in the same process in differen threads. However, every program will check these variables on every print, even outside of testing. This involves allocating a thread local key, and registering a thread local destructor. This can be somewhat expensive. This change keeps a global flag (LOCAL_STREAMS) which will be set to true when either of these local streams is used. (So, effectively only in test and benchmark runs.) When this flag is off, these thread locals are not even looked at and therefore will not be initialized on the first output on every thread, which also means no thread local destructors will be registered.	2020-09-27 16:04:25 +02:00
bors	c9e5e6a53a	Auto merge of #77154 - fusion-engineering-forks:lazy-stdio, r=dtolnay Remove std::io::lazy::Lazy in favour of SyncOnceCell The (internal) std::io::lazy::Lazy was used to lazily initialize the stdout and stdin buffers (and mutexes). It uses atexit() to register a destructor to flush the streams on exit, and mark the streams as 'closed'. Using the stream afterwards would result in a panic. Stdout uses a LineWriter which contains a BufWriter that will flush the buffer on drop. This one is important to be executed during shutdown, to make sure no buffered output is lost. It also forbids access to stdout afterwards, since the buffer is already flushed and gone. Stdin uses a BufReader, which does not implement Drop. It simply forgets any previously read data that was not read from the buffer yet. This means that in the case of stdin, the atexit() function's only effect is making stdin inaccessible to the program, such that later accesses result in a panic. This is uncessary, as it'd have been safe to access stdin during shutdown of the program. --- This change removes the entire io::lazy module in favour of SyncOnceCell. SyncOnceCell's fast path is much faster (a single atomic operation) than locking a sys_common::Mutex on every access like Lazy did. However, SyncOnceCell does not use atexit() to drop the contained object during shutdown. As noted above, this is not a problem for stdin. It simply means stdin is now usable during shutdown. The atexit() call for stdout is moved to the stdio module. Unlike the now-removed Lazy struct, SyncOnceCell does not have a 'gone and unusable' state that panics. Instead of adding this again, this simply replaces the buffer with one with zero capacity. This effectively flushes the old buffer and makes any writes afterwards pass through directly without touching a buffer, making print!() available during shutdown without panicking. --- In addition, because the contents of the SyncOnceCell are no longer dropped, we can now use `&'static` instead of `Arc` in `Stdout` and `Stdin`. This also saves two levels of indirection in `stdin()` and `stdout()`, since Lazy effectively stored a `Box<Arc<T>>`, and SyncOnceCell stores the `T` directly.	2020-09-27 04:50:46 +00:00
Mara Bos	6f9c1323a7	Call ReentrantMutex::init() in stdout().	2020-09-24 19:25:21 +02:00
Mara Bos	45700a9d58	Drop use of Arc from Stdin and Stdout.	2020-09-24 19:09:33 +02:00
Mara Bos	bab15f773a	Remove std::io::lazy::Lazy in favour of SyncOnceCell The (internal) std::io::lazy::Lazy was used to lazily initialize the stdout and stdin buffers (and mutexes). It uses atexit() to register a destructor to flush the streams on exit, and mark the streams as 'closed'. Using the stream afterwards would result in a panic. Stdout uses a LineWriter which contains a BufWriter that will flush the buffer on drop. This one is important to be executed during shutdown, to make sure no buffered output is lost. It also forbids access to stdout afterwards, since the buffer is already flushed and gone. Stdin uses a BufReader, which does not implement Drop. It simply forgets any previously read data that was not read from the buffer yet. This means that in the case of stdin, the atexit() function's only effect is making stdin inaccessible to the program, such that later accesses result in a panic. This is uncessary, as it'd have been safe to access stdin during shutdown of the program. --- This change removes the entire io::lazy module in favour of SyncOnceCell. SyncOnceCell's fast path is much faster (a single atomic operation) than locking a sys_common::Mutex on every access like Lazy did. However, SyncOnceCell does not use atexit() to drop the contained object during shutdown. As noted above, this is not a problem for stdin. It simply means stdin is now usable during shutdown. The atexit() call for stdout is moved to the stdio module. Unlike the now-removed Lazy struct, SyncOnceCell does not have a 'gone and unusable' state that panics. Instead of adding this again, this simply replaces the buffer with one with zero capacity. This effectively flushes the old buffer and makes any writes afterwards pass through directly without touching a buffer, making print!() available during shutdown without panicking.	2020-09-24 18:18:48 +02:00
ecstatic-morse	65bdf79da3	Rollup merge of #76275 - FedericoPonzi:immutable-write-impl-73836, r=dtolnay Implementation of Write for some immutable ref structs Fixes #73836	2020-09-21 20:40:44 -07:00
Federico Ponzi	88a29e630c	Updates stability attributes to the current nightly version	2020-09-21 08:52:59 +02:00
Federico Ponzi	ec7f9b927f	Deduplicates io::Write implementations	2020-09-11 11:39:31 +02:00
Nathan West	96229f0240	move buffered.rs to mod.rs	2020-09-10 23:48:22 -04:00
Nathan West	a020142805	Refactor io/buffered.rs into submodules	2020-09-10 23:39:55 -04:00
bors	9fe551ae49	Auto merge of #74366 - t-rapp:tr-bufreader-pos, r=LukasKalbertodt Implement Seek::stream_position() for BufReader Optimization over `BufReader::seek()` for getting the current position without flushing the internal buffer. Related to #31100. Based on the code in #70577.	2020-09-07 11:09:41 +00:00
Tobias Rapp	246d3271fe	Implement Seek::stream_position() for BufReader Optimization over BufReader::seek() for getting the current position without flushing the internal buffer. Related to #31100. Based on code in #70577.	2020-09-07 09:26:48 +02:00
Federico Ponzi	28db5214d2	More implementations of Write for immutable refs Fixes #73836	2020-09-03 09:36:05 +02:00
Ralf Jung	0af3bd01df	Read: adjust a FIXME reference	2020-09-02 12:34:15 +02:00
bors	d9cd4a33f5	Auto merge of #76047 - Dylan-DPC:rename/maybe, r=RalfJung rename get_{ref, mut} to assume_init_{ref,mut} in Maybeuninit References #63568 Rework with comments addressed from #66174 Have replaced most of the occurrences I've found, hopefully didn't miss out anything r? @RalfJung (thanks @danielhenrymantilla for the initial work on this)	2020-09-01 05:41:22 +00:00
Lzu Tao	a4e926daee	std: move "mod tests/benches" to separate files Also doing fmt inplace as requested.	2020-08-31 02:56:59 +00:00
DPC	b3d7b7bdcb	update fixmes	2020-08-30 14:43:52 +02:00
DPC	5e208efaa8	rename get_{ref, mut} to assume_init_{ref,mut} in Maybeuninit	2020-08-29 02:13:02 +02:00
bors	7b1dd61bda	Auto merge of #72808 - Lucretiel:line-writer-reimpl, r=Amanieu Substantial refactor to the design of LineWriter # Preamble This is the first in a series of pull requests designed to move forward with https://github.com/rust-lang/rust/issues/60673 (and the related [5 year old FIXME](`ea7181b5f7/src/libstd/io/stdio.rs (L459-L461)`)), which calls for an update to `Stdout` such that it can be block-buffered rather than line-buffered under certain circumstances (such as a `tty`, or a user setting the mode with a function call). This pull request refactors the logic `LineWriter` into a `LineWriterShim`, which operates on a `BufWriter` by mutable reference, such that it is easy to invoke the line-writing logic on an existing `BufWriter` without having to construct a new `LineWriter`. Additionally, fixes #72721 ## A note on flushing Because the word flush tends to be pretty overloaded in this discussion, I'm going to use the word unbuffered to refer to a `BufWriter` sending its data to the wrapped writer via `write`, without calling `flush` on it, and I'll be using flushed when referring to sending data via flush, which recursively writes the data all the way to the final sink. For example, given a `T = BufWriter<BufWriter<File>>`, saying that `T` unbuffers its data means that it is sent to the inner `BufWriter`, but not necessarily to the `File`, whereas saying that `T` flushes its data means that causes it (via `Write::flush`) to be delivered all the way to `File`. # Goals Once it became clear (for reasons described below) that the best way to approach this would involve refactoring `LineWriter` to work more directly on `BufWriter`'s internals, I established the following design goals for the refactor: - Do not duplicate logic with `BufWriter`. It's great at buffering and then unbuffering data, so use the existing logic as much as possible. - Minimize superfluous copying of data into `BufWriter`'s buffer. - Eliminate calls to `BufWriter::flush` and instead do the same thing as `BufWriter::write`, which is to only write to the wrapped writer (rather than flushing all the way down to the final data sink). - Uphold the "at-most 1 write of new data" convention of `Write::write` - Minimize or eliminate dropping errors (that is, eliminate the parts of the old design that threw away errors because `write` must report if any bytes were written) - As much as possible, attempt to fully flush completed lines, and not flush partial lines. One of the advantages of this design is that, so long as we don't encounter lines larger than the `BufWriter`'s capacity, partial lines will never be unbuffered, while completed lines will always be unbuffered (with subsequent calls to `LineWriter::write` retrying failed writes before processing new data. # Design There are two major & related parts of the design. First, a new internal stuct, `LineWriterShim`, is added. This struct implements all of the actual logic of line-writing in a `Write` implementation, but it only operates on an `&mut BufWriter`. This means that this shim can be constructed on-the-fly to apply line writing logic to an existing `BufWriter`. This is in fact how `LineWriter` has been updated to operate, and it is also how `Stdout` is being updated in my [development branch](https://github.com/Lucretiel/rust/tree/stdout-block-buffer) to switch which mode it wants to use at runtime. [An example of how this looks in practice](`f24f272df6/src/libstd/io/stdio.rs (L479-L484)` ) The second major part of the design that the line-buffering logic, implemented in `LineWriterShim`, has been updated to work slightly more directly on the internals of `BufWriter`. Mostly it makes us of the public interface—particularly `buffer()` and `get_mut()`—but it also controls the flushing of the buffer with `flush_buf` rather than `flush`, and it writes to the buffer infallibly with a new `write_to_buffer` method. This has several advantages: - Data no longer has to round trip through the `BufWriter`'s buffer. If the user provides a complete line, that line is written directly to the inner writer (after ensuring the existing buffer is flushed). - The conventional contract of `write`—that at-most 1 attempt to write new data is made—is much more cleanly upheld, because we don't have to perform fallible flushes and perform semi-complicated logic of trying to pretend errors at different stages didn't happen. Instead, after attempting to write lines directly to the buffer, we can infallibly add trailing data to the buffer without allowing any attempts to continue writing it to the `inner` writer. - Perhaps most importantly, `LineWriter` no longer performs a full flush on every line. This makes its behavior much more consistent with `BufWriter`, which unbuffers data to its inner writer, without trying to flush it all the way to the final device. Previously, `LineWriter` had no choice but to use `flush` to ensure that the lines were unbuffered, but by writing directly to `inner` via `get_mut()` (when appropriate), we can use a more correct behavior. ## New(ish) line buffering logic The logic for line writing has been cleaned up, as described above. It now follows this algorithm for `write`, with minor adjustments for `write_all` and `write_vectored`: - Does our input data contain a newline? - If no: - simply use the regular `BufWriter::write` to write it; this will append it to the buffer and/or flush it as necessary based on how full the buffer is and how much input data there is. - additionally, if the current buffer ends with `'\n'`, attempt to immediately flush it with `flush_buf` before calling `BufWriter::write` This reproduces the old `needs_flush` behavior and ensures completed lines are flushed as soon as possible. The reason we only check if the buffer ends with `'\n'` is discussed later. - If yes: - First, `flush_buf` - Then use `bufwriter.get_mut().write()` to write the input data directly to the underlying writer, up to the last newline. Make at most one attempt at this. - If it errors, return the error - If it succeeds with a full write, add the remaining data (between the last newline and the end of the input) to the buffer. In order to uphold the "at-most 1 attempt to write new data" convention, no attempts are made to write this data to the inner writer (though obviously a subsequent write may immediately flush it, e.g., if it totally filled the buffer's capacity. - If it only partially succeeds, buffer the data only up to the last newline. We do this to try to avoid writing partial lines to the inner writer where possible (that is, whenever the lines are shorter than the total buffer capacity). While it was not my intention for this behavior to diverge from this existing `LineWriter` algorithm, this updated design emerged very naturally once `LineWriter` wasn't burdened with having to only operate via `BufWriter::flush`. There essentially two main changes to observable behavior: - `flush` is no longer used to unbuffer lines. The are only written to the writer wrapped by `LineWriter`; this inner writer might do its own buffering. This change makes `LineWriter` consistent with the behavior of `BufWriter`. This is probably the most obvious user-visible change; it's the one I most expect to provoke issue reports, if any are provoked. - Unless a line exceeds the capacity of the buffer, partial lines are not unbuffered (without the user manually calling flush). This is a less surprising behavior, and is enabled because `LineWriter` now has more precise control of what data is buffered and when it is unbuffered. I'd be surprised if anyone is relying on `LineWriter` unbuffering or flushing partial lines that are shorter than the capacity, so I'm not worried about this one. None of these changes are inconsistent with any published documentation of `LineWriter`. Nonetheless, like all changes with user-facing behavior changes, this design will obviously have to be very carefully scrutinized. # Alternative designs and design rationalle The initial goal of this project was to provide a way for the `LineWriter` logic to be operable directly on a `BufWriter`, so that the updated `Stdout` doesn't need to do something convoluted like `enum { BufWriter, LineWriter }` (which ends up being ~~impossible~~ difficult to transition between states after being constructed). The design went through several iterations before arriving at the current draft. The major first version simply involved adding methods like `write_line_buffered` to `BufWriter`; these would contain the actual logic of line-buffered writing, and would additionally have the advantages (described above) of operating directly on the internals of `BufWriter`. The idea was that `LineWriter` would simply call these methods, and the updated `Stdout` would use either `BufWriter::write` or `BufWriter::write_line_buffered`, depending on what mode it was in. The major issue with this design is that it loses the ability to take advantage of the `io::Write` trait, which provides several useful default implementations of the various io methods, such as `write_fmt` and `write_all`, just using the core methods. For this reason, the `write_line_buffered` design was retained, but moved into a separate struct called `LineWriterShim` which operates on an `&mut LineWriter`. As part of this move, the logic was lightly retooled to not touch the innards of `BufWriter` directly, but instead to make use of the unexported helper methods like `flush_buf`. The other design evolutions were mostly related to answering questions like "how much data should be buffered", "how should partial line writes be handled", etc. As much as possible I tried to answer these by emulating the current `LineWriter` logic (which, for example, retries partial line writes on subsequent calls to `write`) while still meeting the refactor design goals. # Next steps ~Currently, this design fails a few `LineWriter` tests, mostly because they expect `LineWriter` to fully flush its content. There are also some changes to the way that `LineWriter` buffers data after writing completed lines, aimed at ensuring that partial lines are not unbuffered prematurely. I want to make sure I fully understand the intent behind these tests before I either update the test or update this design so that they pass.~ However, in the meantime I wanted to get this published so that feedback could start to accumulate on it. There's a lot of errata around how I arrived at this design that didn't really fit in this overlong document, so please ask questions about anything that confusing or unclear and hopefully I can explain more of the rationale that led to it. # Test updates This design required some tests to be updated; I've research the intent behind these tests (mostly via `git blame`) and updated them appropriately. Those changes are cataloged here. - `test_line_buffer_fail_flush`: This test was added as a regression test for #32085, and is intended to assure that an errors from `flush` aren't propagated when preceded by a successful `write`. Because type of issue is no longer possible, because `write` calls `buffer.get_mut().write()` instead of `buffer.write(); buffer.flush();`, I'm simply removing this test entirely. Other, similar error invariants related to errors during write-retrying are handled in other test cases. - `erroneous_flush_retried`: This test was added as a regression test for #37807, and was intended to ensure that flush-retrying (via `needs_flush`) and error-ignoring were being handled correctly (ironically, this issue was caused by the flush-error-ignoring, above). Half of that issue is not possible by design with this refactor, because we no longer make fallible i/o calls that might produce errors we have to ignore after unbuffering lines. The `should_flush` behavior is captured by checking for a trailing newline in the `LineWriter` buffer; this test now checks that behavior. - `line_vectored`: changes here were pretty minor, mostly related to when partial lines are or aren't written. The old implementation of `write_vectored` used very complicated logic to precisely determine the location of the last newline and precisely write up to that point; this required doing several consecutive fallible writes, with all the complex error handling or ignoring issues that come with it. The updated design does at-most one write of a subset of total buffers (that is, it doesn't split in the middle of a buffer), even if that means writing partial lines. One of the major advantages of the new design is that the underlying vectored write operation on the device can be taken advantage of, even with small writes, so long as they include a newline; previously these were unconditionally buffered then written. - `line_vectored_partial_and_errors`: Pretty similiar to `line_vectored`, above; this test is for basic error recovery in `write_vectored` for vectored writes. As previously discussed, the mocked behavior being tested for (errors ignored under certain circumstances) no occurs, so I've simplified the test while doing my best to retain its spirit.	2020-08-28 23:41:57 +00:00
Nathan West	c91e764d51	Once again, x.py tidy	2020-08-27 22:55:58 -04:00
Nathan West	d2d8bcb50e	Typo fixes	2020-08-27 22:49:16 -04:00
Nathan West	017ed5a579	Improvements to `LineWriter::write_all` `LineWriter::write_all` now only emits a single write when writing a newline when there's already buffered data.	2020-08-27 22:32:28 -04:00
Joshua Nelson	6f4681bacc	Convert str -> prim@str in `std`	2020-08-23 22:40:20 -04:00
Tomasz Miąsko	78e094632e	Remove wrapper type handling absent raw standard streams Raw standard streams are always available. Remove unused wrapper type that was supposed to be responsible for handling their absence.	2020-08-21 13:17:20 +02:00
Tomasz Miąsko	4a00421ba4	Make raw standard stream constructors const	2020-08-21 13:17:20 +02:00
Tomasz Miąsko	479c23bb49	Remove result type from raw standard streams constructors Raw standard streams constructors are infallible. Remove unnecessary result type.	2020-08-21 13:17:20 +02:00
Alexis Bourget	dad8e11e9f	Fix nits in intra-doc links for std io	2020-08-19 16:26:17 +02:00
Alexis Bourget	5d49c0e55a	Move to intra doc links for std::io	2020-08-18 19:36:52 +02:00
Camelid	a7749fe451	Fix intra-doc link	2020-08-12 15:30:15 -07:00
Camelid	bc8367617e	Switch to intra-doc links in `std/io/mod.rs`	2020-08-12 15:11:17 -07:00
Nathan West	3aa233d3dc	Rebase the LineWriter refactor to the new stdlib layout	2020-08-12 15:04:53 -04:00
mark	2c31b45ae8	mv std libs to library/	2020-07-27 19:51:13 -05:00

... 2 3 4 5 6

294 Commits