The module file provides an interface to the file system.
On operating systems with thread support (Solaris and Windows),
it is possible to let file operations be performed in threads of
their own, allowing other Erlang processes to continue executing
in parallel with the file operations. See the command line flag
+A in erl(1).
iodata() = iolist() | binary()
iolist() = [char() | binary() | iolist()]
io_device()
as returned by file:open/2, a process handling IO protocols
name() = string() | atom() | DeepList
DeepList = [char() | atom() | DeepList]
posix()
an atom which is named from the Posix error codes used in
Unix, and in the runtime libraries of most C compilers
time() = {{Year, Month, Day}, {Hour, Minute, Second}}
Year = Month = Day = Hour = Minute = Second = int()
Must denote a valid date and time
change_group(Filename, Gid) -> ok | {error, Reason}
Types:
Filename = name()
Gid = int()
Reason = posix()
Changes group of a file. See write_file_info/2.
change_owner(Filename, Uid) -> ok | {error, Reason}
Types:
Filename = name()
Uid = int()
Reason = posix()
Changes owner of a file. See write_file_info/2.
change_owner(Filename, Uid, Gid) -> ok | {error, Reason}
Types:
Filename = name()
Uid = int()
Gid = int()
Reason = posix()
Changes owner and group of a file. See write_file_info/2.
change_time(Filename, Mtime) -> ok | {error, Reason}
Types:
Filename = name()
Mtime = time()
Reason = posix()
Changes the modification and access times of a file. See write_file_info/2.
change_time(Filename, Mtime, Atime) -> ok | {error, Reason}
Types:
Filename = name()
Mtime = Atime = time()
Reason = posix()
Changes the modification and last access times of a file. See write_file_info/2.
close(IoDevice) -> ok | {error, Reason}
Types:
IoDevice = io_device()
Reason = posix()
Closes the file referenced by IoDevice. It mostly
returns ok, expect for some severe errors such as out
of memory.
Note that if the option delayed_write was
used when opening the file, close/1 might return an
old write error and not even try to close the file. See
open/2.
consult(Filename) -> {ok, Terms} | {error, Reason}
Types:
Filename = name()
Terms = [term()]
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see below
Reads Erlang terms, separated by '.', from Filename.
Returns one of the following:
{ok, Terms}
{error, Posix}
{error, {Line, Mod, Term}}
format_error/1 to convert
the three-element tuple to an English description of
the error.Example:
f.txt: {person, "kalle", 25}.
{person, "pelle", 30}.
1> file:consult("f.txt").
{ok,[{person,"kalle",25},{person,"pelle",30}]}
copy(Source, Destination) ->
copy(Source, Destination, ByteCount) -> {ok, BytesCopied}
| {error, Reason}
Types:
Source = Destination = io_device() | Filename
| {Filename, Modes}
Filename = name()
Modes = [Mode] -- see open/2
ByteCount = int() >= 0 | infinity
BytesCopied = int()
Copies ByteCount bytes from Source to
Destination. Source and Destination refer
to either filenames or IO devices from e.g. open/2.
ByteCount defaults infinity, denoting an
infinite number of bytes.
The argument Modes is a list of possible modes, see
open/2, and defaults to
[].
If both Source and Destination refer to
filenames, the files are opened with [read, binary]
and [write, binary] prepended to their mode lists,
respectively, to optimize the copy.
If Source refers to a filename, it is opened with
read mode prepended to the mode list before the copy,
and closed when done.
If Destination refers to a filename, it is opened
with write mode prepended to the mode list before
the copy, and closed when done.
Returns {ok, BytesCopied} where BytesCopied is
the number of bytes that actually was copied, which may be
less than ByteCount if end of file was encountered on
the source. If the operation fails, {error, Reason} is
returned.
Typical error reasons: As for open/2 if a file had to
be opened, and as for read/2 and write/2.
del_dir(Dir) -> ok | {error, Reason}
Types:
Dir = name()
Reason = posix()
Tries to delete the directory Dir. The directory must
be empty before it can be deleted. Returns ok if
successful.
Typical error reasons are:
eacces
Dir.eexist
enoent
enotdir
Dir is not a directory. On some
platforms, enoent is returned instead.einval
eacces is returned instead.delete(Filename) -> ok | {error, Reason}
Types:
Filename = name()
Reason = posix()
Tries to delete the file Filename. Returns ok
if successful.
Typical error reasons are:
enoent
eacces
eperm
enotdir
enoent is returned instead.eval(Filename) -> ok | {error, Reason}
Types:
Filename = name()
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see below
Reads and evaluates Erlang expressions, separated by '.' (or
',', a sequence of expressions is also an expression), from
Filename. The actual result of the evaluation is not
returned; any expression sequence in the file must be there
for its side effect. Returns one of the following:
ok
{error, Posix}
open/2 for a list of typical error codes.{error, {Line, Mod, Term}}
format_error/1 to
convert the three-element tuple to an English description
of the error.eval(Filename, Bindings) -> ok | {error, Reason}
Types:
Filename = name()
Bindings -- see erl_eval(3)
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see eval/1
The same as eval/1 but the variable bindings
Bindings are used in the evaluation. See
erl_eval(3) about
variable bindings.
file_info(Filename) -> {ok, FileInfo} | {error, Reason}
This function is obsolete. Use read_file_info/1
instead.
Types:
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see eval/1
Chars = [char() | Chars]
Given the error reason returned by any function in this module, returns a descriptive string of the error in English.
get_cwd() -> {ok, Dir} | {error, Reason}
Types:
Dir = string()
Reason = posix()
Returns {ok, Dir}, where Dir is the current
working directory of the file server.
![]() |
In rare circumstances, this function can fail on Unix. It may happen if read permission does not exist for the parent directories of the current directory. |
Typical error reasons are:
eacces
get_cwd(Drive) -> {ok, Dir} | {error, Reason}
Types:
Drive = string() -- see below
Dir = string()
Reason = posix()
Drive should be of the form "Letter:",
for example "c:". Returns {ok, Dir} or
{error, Reason}, where Dir is the current
working directory of the drive specified.
This function returns {error, enotsup} on platforms
which have no concept of current drive (Unix, for example).
Typical error reasons are:
enotsup
eacces
einval
Drive is invalid.list_dir(Dir) -> {ok, Filenames} | {error, Reason}
Types:
Dir = name()
Filenames = [Filename]
Filename = string()
Reason = posix()
Lists all the files in a directory. Returns
{ok, Filenames} if successful. Otherwise, it returns
{error, Reason}. Filenames is a list of
the names of all the files in the directory. The names are
not sorted.
Typical error reasons are:
eacces
Dir or
one of its parent directories.enoent
make_dir(Dir) -> ok | {error, Reason}
Types:
Dir = name()
Reason = posix()
Tries to create the directory Dir. Missing parent
directories are not created. Returns ok if
successful.
Typical error reasons are:
eacces
Dir.eexist
Dir.
enoent
Dir does not exist.enospc
enotdir
Dir is not a directory. On some
platforms, enoent is returned instead.make_link(Existing, New) -> ok | {error, Reason}
Types:
Existing = New = name()
Reason = posix()
Makes a hard link from Existing to New, on
platforms that support links (Unix). This function returns
ok if the link was successfully created, or
{error, Reason}. On platforms that do not support
links, {error,enotsup} is returned.
Typical error reasons:
eacces
Existing or New.eexist
New already exists.enotsup
make_symlink(Name1, Name2) -> ok | {error, Reason}
Types:
Name1 = Name2 = name()
Reason = posix()
This function creates a symbolic link Name2 to
the file or directory Name1, on platforms that support
symbolic links (most Unix systems). Name1 need not
exist. This function returns ok if the link was
successfully created, or {error, Reason}. On platforms
that do not support symbolic links, {error, enotsup}
is returned.
Typical error reasons:
eacces
Name1 or Name2.eexist
Name2 already exists.enotsup
open(Filename, Modes) -> {ok, IoDevice} | {error, Reason}
Types:
Filename = name()
Modes = [Mode]
Mode = read | write | append | raw | binary
| {delayed_write, Size, Delay} | delayed_write
| {read_ahead, Size} | read_ahead | compressed
Size = Delay = int()
IoDevice = io_device()
Reason = posix()
Opens the file Filename in the mode determined by
Modes, which may contain one or more of the following
items:
read
write
write is not
combined with read, the file will be truncated.append
append will take place at
the end of the file.raw
raw option allows faster access to a file,
because no Erlang process is needed to handle the file.
However, a file opened in this way has the following
limitations:io module cannot be used,
because they can only talk to an Erlang process.
Instead, use the read/2 and write/2
functions.
binary
raw option
is specified as well. When specified, read operations on
the file using the read/2 function will return
binaries rather than lists.{delayed_write, Size, Delay}
write/2 calls is buffered until there are at least
Size bytes buffered, or until the oldest buffered
data is Delay milliseconds old. Then all buffered
data is written in one operating system call.
The buffered data is also flushed before some other file
operation than write/2 is executed.write/2 calls should be for sizes significantly
less than Size, and not interspersed by to many
other file operations, for this to happen.write/2
calls may prematurely be reported as successful, and if
a write error should actually occur the error is
reported as the result of the next file operation, which
is not executed.delayed_write is used, after a
number of write/2 calls, close/1 might
return {error, enospc} because there was not enough
space on the disc for previously written data, and
close/1 should probably be called again since the
file is still open.delayed_write
{delayed_write, Size, Delay} with
reasonable default values for Size and
Delay. (Roughly some 64 KBytes, 2 seconds){read_ahead, Size}
read/2 calls are for significantly less than
Size bytes, read operations towards the operating
system are still performed for blocks of Size
bytes. The extra data is buffered and returned in
subsequent read/2 calls, giving a performance gain
since the number of operating system calls is reduced.read/2 calls are for sizes not significantly
less than, or even greater than Size bytes, no
performance gain can be expected.read_ahead
{read_ahead, Size} with a reasonable
default value for Size. (Roughly some 64 KBytes)
compressed
read_file_info/1 will most probably not match the
number of bytes that can be read from a compressed file.
Returns:
{ok, IoDevice}
IoDevice is a reference to the file.{error, Reason}
IoDevice is really the pid of the process which
handles the file. This process is linked to the process
which originally opened the file. If any process to which
the IoDevice is linked terminates, the file will be
closed and the process itself will be terminated.
An IoDevice returned from this call can be used as an
argument to the IO functions (see
io(3)).
![]() |
In previous versions of |
Typical error reasons:
enoent
eacces
eisdir
enotdir
enoent is returned instead.enospc
write
access was specified).path_consult(Path, Filename) -> {ok, Terms, FullName}
| {error, Reason}
Types:
Path = [Dir]
Dir = name()
Filename = name()
Terms = [term()]
FullName = string()
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see below
Searches the path Path (a list of directory names)
until the file Filename is found. If Filename
is an absolute filename, Path is ignored.
Then reads Erlang terms, separated by '.', from the file.
Returns one of the following:
{ok, Terms, FullName}
FullName is
the full name of the file.{error, enoent}
Path.{error, Posix}
{error, {Line, Mod, Term}}
format_error/1 to convert
the three-element tuple to an English description of
the error.path_eval(Path, Filename) -> {ok, FullName}
| {error, Reason}
Types:
Path = [Dir]
Dir = name()
Filename = name()
FullName = string()
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see below
Searches the path Path (a list of directory names)
until the file Filename is found. If Filename
is an absolute file name, Path is ignored. Then reads
and evaluates Erlang expressions, separated by '.' (or ',', a
sequence of expressions is also an expression), from the file.
The actual result of evaluation is not returned; any
expression sequence in the file must be there for its side
effect. Returns one of the following:
{ok, FullName}
FullName is
the full name of the file.{error, enoent}
Path.{error, Posix}
{error, {Line, Mod, Term}}
format_error/1 to
convert the three-element tuple to an English description
of the error.path_open(Path, Filename, Modes) -> {ok, IoDevice, FullName}
| {error, Reason}
Types:
Path = [Dir]
Dir = name()
Filename = name()
Modes = [Mode] -- see open/2
IoDevice = io_device()
FullName = string()
Reason = posix()
Searches the path Path (a list of directory names)
until the file Filename is found. If Filename
is an absolute file name, Path is ignored.
Then opens the file in the mode determined by Modes.
Returns one of the following:
{ok, IoDevice, FullName}
IoDevice is a reference to the file and
FullName is the full name of the file.{error, enoent}
Path.{error, Posix}
path_script(Path, Filename) -> {ok, Value, FullName}
| {error, Reason}
Types:
Path = [Dir]
Dir = name()
Filename = name()
Value = term()
FullName = string()
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see below
Searches the path Path (a list of directory names)
until the file Filename is found. If Filename
is an absolute file name, Path is ignored. Then reads
and evalutates Erlang expressions, separated by '.' (or ',', a
sequence of expressions is also an expression), from the file.
Returns one of the following:
{ok, Value, FullName}
FullName is
the full name of the file and Value the value of
the last expression.{error, enoent}
Path.{error, Posix}
{error, {Line, Mod, Term}}
format_error/1 to
convert the three-element tuple to an English description
of the error.path_script(Path, Filename, Bindings) ->
{ok, Value, FullName} | {error, Reason}
Types:
Path = [Dir]
Dir = name()
Filename = name()
Bindings -- see erl_eval(3)
Value = term()
FullName = string()
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see path_script/2
The same as path_script/2 but the variable bindings
Bindings are used in the evaluation. See
erl_eval(3) about
variable bindings.
pid2name(Pid) -> string() | undefined
Types:
Pid = pid()
If Pid is an IO device, that is, a pid returned from
open/2, this function returns the filename, or rather:
{ok, Filename}
Pid must be a local pid) and the file is not
closed. Filename is the filename in flat string
format.undefined
![]() |
This function is intended for debugging only. |
position(IoDevice, Location) -> {ok, NewPosition}
| {error, Reason}
Types:
IoDevice = io_device()
Location = Offset | {bof, Offset} | {cur, Offset}
| {eof, Offset} | bof | cur | eof
Offset = int()
NewPosition = int()
Reason = posix()
Sets the position of the file referenced by IoDevice
to Location. Returns {ok, NewPosition} (as
absolute offset) if successful, otherwise
{error, Reason}. Location is one of
the following:
Offset
{bof, Offset}.{bof, Offset}
{cur, Offset}
{eof, Offset}
bof | cur | eof
Offset 0.Typical error reasons are:
einval
Location was illegal, or it evaluated to a
negative offset in the file. Note that if the resulting
position is a negative value, the result is an error, and
after the call the file position is undefined.pread(IoDevice, LocNums) -> {ok, DataL} | {error, Reason}
Types:
IoDevice = io_device()
LocNums = [{Location, Number}]
Location -- see position/2
Number = int()
DataL = [Data]
Data = [char()] | binary() | eof
Reason = posix()
Performs a sequence of pread/3 in one operation,
which is more efficient than calling them one at a time.
Returns {ok, [Data, ...]} or {error, Reason},
where each Data, the result of the corresponding
pread, is either a list or a binary depending on
the mode of the file, or eof if the requested position
was beyond end of file.
pread(IoDevice, Location, Number) -> {ok, Data}
| {error, Reason}
Types:
IoDevice = io_device()
Location -- see position/2
Number = int()
Data = [char()] | binary() | eof
Reason = posix()
Combines position/2 and read/2 in one
operation, which is more efficient than calling them one at a
time. If IoDevice has been opened in raw mode, some
restrictions apply: Location is only allowed to be an
integer; and the current position of the file is undefined
after the operation.
pwrite(IoDevice, LocBytes) -> ok | {error, {N, Reason}}
Types:
IoDevice = io_device()
LocBytes = [{Location, Bytes}]
Location -- see position/2
Bytes = iodata()
N = int()
Reason = posix()
Performs a sequence of pwrite/3 in one operation,
which is more efficient than calling them one at a time.
Returns ok or {error, {N, Reason}}, where
N is the number of successful writes that was done
before the failure.
pwrite(IoDevice, Location, Bytes) -> ok | {error, Reason}
Types:
IoDevice = io_device()
Location -- see position/2
Bytes = iodata()
Reason = posix()
Combines position/2 and write/2 in one
operation, which is more efficient than calling them one at a
time. If IoDevice has been opened in raw mode, some
restrictions apply: Location is only allowed to be an
integer; and the current position of the file is undefined
after the operation.
read(IoDevice, Number) -> {ok, Data} | eof | {error, Reason}
Types:
IoDevice = io_device()
Number = int()
Data = [char()] | binary()
Reason = posix()
Reads Number bytes from the file referenced by
IoDevice. This function is the only way to read from a
file opened in raw mode (although it works for normally
opened files, too). Returns:
{ok, Data}
eof
Number>0 and end of file was reached
before anything at all could be read.{error, Reason}
Typical error reasons:
ebadf
read_file(Filename) -> {ok, Binary} | {error, Reason}
Types:
Filename = name()
Binary = binary()
Reason = posix()
Returns {ok, Binary}, where Binary is a binary
data object that contains the contents of Filename, or
{error, Reason} if an error occurs.
Typical error reasons:
enoent
eacces
eisdir
enotdir
enoent is returned instead.enomem
read_file_info(Filename) -> {ok, FileInfo} | {error, Reason}
Types:
Filename = name()
FileInfo = #file_info{}
Reason = posix()
Retrieves information about a file. Returns
{ok, FileInfo} if successful, otherwise
{error, Reason}. FileInfo is a record
file_info, defined in the Kernel include file
file.hrl. Include the following directive in the module
from which the function is called:
-include_lib("kernel/include/file.hrl").
The record file_info contains the following fields.
size = int()
type = device | directory | regular | other
access = read | write | read_write | none
atime = time()
mtime = time()
ctime = time()
mode = int()
links = int()
major_device = int()
minor_device = int()
inode = int()
inode number. On non-Unix file systems,
this field will be zero.uid = int()
gid = int()
Typical error reasons:
eacces
enoent
enotdir
enoent is returned instead.read_link(Name) -> {ok, Filename} | {error, Reason}
Types:
Name = name()
Filename = string()
Reason = posix()
This function returns {ok, Filename} if Name
refers to a symbolic link or {error, Reason} otherwise.
On platforms that do not support symbolic links, the return
value will be {error,enotsup}.
Typical error reasons:
einval
Linkname does not refer to a symbolic link.enoent
enotsup
read_link_info(Name) -> {ok, FileInfo} | {error, Reason}
Types:
Name = name()
FileInfo = #file_info{}, see read_file_info/1
Reason = posix()
This function works like read_file_info/1, except that
if Name is a symbolic link, information about the link
will be returned in the file_info record and
the type field of the record will be set to
symlink.
If Name is not a symbolic link, this function returns
exactly the same result as read_file_info/1.
On platforms that do not support symbolic links, this function
is always equvivalent to read_file_info/1.
rename(Source, Destination) -> ok | {error, Reason}
Types:
Source = Destination = name()
Reason = posix()
Tries to rename the file Source to Destination.
It can be used to move files (and directories) between
directories, but it is not sufficient to specify
the destination only. The destination file name must also be
specified. For example, if bar is a normal file and
foo and baz are directories,
rename("foo/bar", "baz") returns an error, but
rename("foo/bar", "baz/bar") succeeds. Returns
ok if it is successful.
![]() |
Renaming of open files is not allowed on most platforms
(see |
Typical error reasons:
eacces
Source or Destination. On
some platforms, this error is given if either
Source or Destination is open.eexist
Destination is not an empty directory. On some
platforms, also given when Source and
Destination are not of the same type.einval
Source is a root directory, or Destination
is a sub-directory of Source.eisdir
Destination is a directory, but Source is
not.enoent
Source does not exist.enotdir
Source is a directory, but Destination is
not.exdev
Source and Destination are on different
file systems.script(Filename) -> {ok, Value} | {error, Reason}
Types:
Filename = name()
Value = term()
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see below
Reads and evaluates Erlang expressions, separated by '.' (or ',', a sequence of expressions is also an expression), from the file. Returns one of the following:
{ok, Value}
Value is
the value of the last expression.{error, Posix}
{error, {Line, Mod, Term}}
format_error/1 to
convert the three-element tuple to an English description
of the error.script(Filename, Bindings) -> {ok, Value} | {error, Reason}
Types:
Filename = name()
Bindings -- see erl_eval(3)
Value = term()
Reason = posix() | {Line, Mod, Term}
Line, Mod, Term -- see below
The same as script/1 but the variable bindings
Bindings are used in the evaluation. See
erl_eval(3) about
variable bindings.
Types:
Dir = name()
Sets the current working directory of the file server to
Dir. Returns ok if successful.
Typical error reasons are:
enoent
enotdir
Dir is not a directory. On some
platforms, enoent is returned.eacces
sync(IoDevice) -> ok | {error, Reason}
Types:
IoDevice = io_device()
Reason = posix()
Makes sure that any buffers kept by the operating system (not by the Erlang runtime system) are written to disk. On some platforms, this function might have no effect.
Typical error reasons are:
enospc
truncate(IoDevice) -> ok | {error, Reason}
Types:
IoDevice = io_device()
Reason = posix()
Truncates the file referenced by IoDevice at
the current position. Returns ok if successful,
otherwise {error, Reason}.
write(IoDevice, Bytes) -> ok | {error, Reason}
Types:
IoDevice = io_device()
Bytes = iodata()
Reason = posix()
Writes Bytes to the file referenced by
IoDevice. This function is the only way to write to a
file opened in raw mode (although it works for normally
opened files, too). Returns ok if successful, and
{error, Reason} otherwise.
Typical error reasons are:
ebadf
enospc
write_file(Filename, Binary) -> ok | {error, Reason}
Types:
Filename = name()
Binary = binary()
Reason = posix()
Writes the contents of the binary data object Binary
to the file Filename. The file is created if it does
not exist. If it exists, the previous contents are
overwritten. Returns ok, or {error, Reason}.
Typical error reasons are:
enoent
enotdir
enoent is returned instead.enospc
eacces
eisdir
write_file(Filename, Binary, Modes) -> ok | {error, Reason}
Types:
Filename = name()
Binary = binary()
Modes = [Mode] -- see open/2
Reason = posix()
Same as write_file/2, but takes a third argument
Modes, a list of possible modes, see
open/2. The mode flags
binary and write are implicit, so they should
not be used.
write_file_info(Filename, FileInfo) -> ok | {error, Reason}
Types:
Filename = name()
FileInfo = #file_info{} -- see also read_file_info/1
Reason = posix()
Change file information. Returns ok if successful,
otherwise {error, Reason}. FileInfo is a record
file_info, defined in the Kernel include file
file.hrl. Include the following directive in the module
from which the function is called:
-include_lib("kernel/include/file.hrl").
The following fields are used from the record, if they are given.
atime = time()
mtime = time()
ctime = time()
mode = int()
uid = int()
gid = int()
Typical error reasons:
eacces
enoent
enotdir
enoent is returned instead.
eacces - permission denied
eagain - resource temporarily unavailable
ebadf - bad file number
ebusy - file busy
edquot - disk quota exceeded
eexist - file already exists
efault - bad address in system call argument
efbig - file too large
eintr - interrupted system call
einval - invalid argument
eio - IO error
eisdir - illegal operation on a directory
eloop - too many levels of symbolic links
emfile - too many open files
emlink - too many links
enametoolong - file name too long
enfile - file table overflow
enodev - no such device
enoent - no such file or directory
enomem - not enough memory
enospc - no space left on device
enotblk - block device required
enotdir - not a directory
enotsup - operation not supported
enxio - no such device or address
eperm - not owner
epipe - broken pipe
erofs - read-only file system
espipe - invalid seek
esrch - no such process
estale - stale remote file handle
exdev - cross-domain link
Some operating system file operations, for example a
sync/1 or close/1 on a huge file, may block their
calling thread for seconds. If this befalls the emulator main
thread, the response time is no longer in the order of
milliseconds, depending on the definition of "soft" in soft
real-time system.
If the device driver thread pool is active, file operations are done through those threads instead, so the emulator can go on executing Erlang processes. Unfortunately, the time for serving a file operation increases due to the extra scheduling required from the operating system.
If the device driver thread pool is disabled or of size 0, large
file reads and writes are segmented into several smaller, which
enables the emulator so server other processes during the file
operation. This gives the same effect as when using the thread
pool, but with larger overhead. Other file operations, for
example sync/1 or close/1 on a huge file, still are
a problem.
For increased performance, raw files are recommended. Raw files uses the file system of the node's host machine. For normal files (non-raw), the file server is used to find the files, and if the node is running its file server as slave to another node's, and the other node runs on some other host machine, they may have different file systems. This is seldom a problem, but you have now been warned.
A normal file is really a process so it can be used as an IO
device (see io). Therefore when data is written to a
normal file, the sending of the data to the file process, copies
all data that are not binaries. Opening the file in binary mode
and writing binaries is therefore recommended. If the file is
opened on another node, or if the file server runs as slave to
another node's, also binaries are copied.
Caching data to reduce the number of file operations, or rather the number of calls to the file driver, will generally increase performance. The following function writes 4 MBytes in 23 seconds when tested:
create_file_slow(Name, N) when integer(N), N >= 0 ->
{ok, FD} = file:open(Name, [raw, write, delayed_write, binary]),
ok = create_file_slow(FD, 0, N),
ok = ?FILE_MODULE:close(FD),
ok.
create_file_slow(FD, M, M) ->
ok;
create_file_slow(FD, M, N) ->
ok = file:write(FD, <<M:32/unsigned>>),
create_file_slow(FD, M+1, N).
The following, functionally equivalent, function collects 1024
entries into a list of 128 32-byte binaries before each call to
file:write/2 and so does the same work in 0.52 seconds,
which is 44 times faster.
create_file(Name, N) when integer(N), N >= 0 ->
{ok, FD} = file:open(Name, [raw, write, delayed_write, binary]),
ok = create_file(FD, 0, N),
ok = ?FILE_MODULE:close(FD),
ok.
create_file(FD, M, M) ->
ok;
create_file(FD, M, N) when M + 1024 =< N ->
create_file(FD, M, M + 1024, []),
create_file(FD, M + 1024, N);
create_file(FD, M, N) ->
create_file(FD, M, N, []).
create_file(FD, M, M, R) ->
ok = file:write(FD, R);
create_file(FD, M, N0, R) when M + 8 =< N0 ->
N1 = N0-1, N2 = N0-2, N3 = N0-3, N4 = N0-4,
N5 = N0-5, N6 = N0-6, N7 = N0-7, N8 = N0-8,
create_file(FD, M, N8,
[<<N8:32/unsigned, N7:32/unsigned,
N6:32/unsigned, N5:32/unsigned,
N4:32/unsigned, N3:32/unsigned,
N2:32/unsigned, N1:32/unsigned>> | R]);
create_file(FD, M, N0, R) ->
N1 = N0-1,
create_file(FD, M, N1, [<<N1:32/unsigned>> | R]).
![]() |
Trust only your own benchmarks. If the list length in
If the size of each binary is increased to 64 bytes, it will also run slightly faster, but the code will be twice as clumsy. In the current implementation are binaries larger than 64 bytes stored in memory common to all processes and not copied when sent between processes, while these smaller binaries are stored on the process heap and copied when sent like any other term. So, with a binary size of 68 bytes |
A raw file is really a port. When writing data to a port, it is
efficient to write a list of binaries. There is no need to
flatten a deep list before writing. On Unix hosts, scatter output,
which writes a set of buffers in one operation, is used when
possible. In this way file:write(FD, [Bin1, Bin2 | Bin3])
will write the contents of the binaries without copying the data
at all except for perhaps deep down in the operating system
kernel.
For raw files, pwrite/2 and pread/2 are
efficiently implemented. The file driver is called only once for
the whole operation, and the list iteration is done in the file
driver.
The options delayed_write and read_ahead to
file:open/2 makes the file driver cache data to reduce
the number of operating system calls. The function
create_file/2 in the example above takes 60 seconds
seconds without the delayed_write option, which is 2.6
times slower.
And, as a really bad example, create_file_slow/2 above
without the raw, binary and delayed_write
options, that is it calls file:open(Name, [write]), needs
1 min 20 seconds for the job, which is 3.5 times slower than
the first example, and 150 times slower than the optimized
create_file/2.
If an error occurs when accessing an open file with the io
module, the process which handles the file will exit. The dead
file process might hang if a process tries to access it later.
This will be fixed in a future release.