Reading a file in reverse using boost::iostreams - Boost-users - lists.stage.boost.cppalliance.org

newer
[gil] Boost.MPL replaced with...

Reading a file in reverse using boost::iostreams

older
[release] Boost 1.70.0 Release...

Sean Farrow

12 Feb 2019 12 Feb '19

4:33 p.m.

Hi, Is there an easy way to read a file in reverse using boost::iostreams? I've got a case where I need to detect whether text is present and it's closer to the end of the file than the beginning. Any help appreciated. Kind regards Sean.

Attachments:

attachment.html (text/html — 2.3 KB)

Reply

Sign in to reply online Use email software

Show replies by date

Craig Henderson

12 Feb 12 Feb

6:30 p.m.

Reading in reverse is likely to be much slower because of the buffering, so I doubt it will have any performance gain that you seem to be looking for. Craig

On 12 Feb 2019, at 16:33, Sean Farrow via Boost-users <boost-users@lists.boost.org> wrote:

Hi,

Is there an easy way to read a file in reverse using boost::iostreams? I’ve got a case where I need to detect whether text is present and it’s closer to the end of the file than the beginning. Any help appreciated. Kind regards Sean. _______________________________________________ Boost-users mailing list Boost-users@lists.boost.org https://lists.boost.org/mailman/listinfo.cgi/boost-users

Reply

Sign in to reply online Use email software

Gavin Lambert

11:08 p.m.

On 13/02/2019 05:33, Sean Farrow wrote:

Is there an easy way to read a file in reverse using boost::iostreams?

I’ve got a case where I need to detect whether text is present and it’s closer to the end of the file than the beginning.

You should be able to read the length of the stream, then seek to a position near the end and read forwards from there. Of course, you need to know a suitable value to use as the range where you expect the value to be present; if you get this wrong then you'll either have a false negative or you'll waste a bit more time jumping back further and trying again.

Reply

Sign in to reply online Use email software

Richard Hodges

13 Feb 13 Feb

4:43 a.m.

My solution would be: 1. memory map the file (either use boost.interprocess or trivially hand-roll a few OS calls) 2. build an iterator pair (i.e. char *) representing the extent of the mapped memory, 3. call std::make_reverse_iterator on the iterator pair 4. use a standard algorithm On Wed, 13 Feb 2019 at 06:09, Gavin Lambert via Boost-users < boost-users@lists.boost.org> wrote:

On 13/02/2019 05:33, Sean Farrow wrote:

...
Is there an easy way to read a file in reverse using boost::iostreams?

I’ve got a case where I need to detect whether text is present and it’s closer to the end of the file than the beginning.

You should be able to read the length of the stream, then seek to a position near the end and read forwards from there.

Of course, you need to know a suitable value to use as the range where you expect the value to be present; if you get this wrong then you'll either have a false negative or you'll waste a bit more time jumping back further and trying again. _______________________________________________ Boost-users mailing list Boost-users@lists.boost.org https://lists.boost.org/mailman/listinfo.cgi/boost-users

-- Richard Hodges hodges.r@gmail.com office: +442032898513 home: +376841522 mobile: +376380212

Reply

Sign in to reply online Use email software

Niall Douglas

8:15 p.m.

On 13/02/2019 04:43, Richard Hodges via Boost-users wrote:

My solution would be:

1. memory map the file (either use boost.interprocess or trivially hand-roll a few OS calls) 2. build an iterator pair (i.e. char *) representing the extent of the mapped memory, 3. call std::make_reverse_iterator on the iterator pair 4. use a standard algorithm

Unless the file is warm cached, this will be slow. I know of no kernel which performs readbehind, only readahead. Safer is to do as Gavin suggests, jump to some offset from the maximum extent, read forwards. Niall

Reply

Sign in to reply online Use email software

Gavin Lambert

10:32 p.m.

On 14/02/2019 09:15, Niall Douglas wrote:

On 13/02/2019 04:43, Richard Hodges wrote:

...
My solution would be:

1. memory map the file (either use boost.interprocess or trivially hand-roll a few OS calls) 2. build an iterator pair (i.e. char *) representing the extent of the mapped memory, 3. call std::make_reverse_iterator on the iterator pair 4. use a standard algorithm

Unless the file is warm cached, this will be slow. I know of no kernel which performs readbehind, only readahead.

I imagine what would happen is that it would either read the whole file into memory at once (presumably only if it's small) -- which would then be fast to iterate, but not really any better than just reading it normally -- or it would reserve pages and then when you started reading from the end it would commit a page or two read from the end of the file, so it would be reasonably fast reading forwards or backwards after that until you cross a page boundary. So it may not be a problem if your target is within the last 4kB or so of the file. Having said that, by definition this can't really be any faster than doing what I suggested (modulo some issues with page sizes and alignments). And if you use a reverse iterator it also requires you to recognise your search pattern in reverse as well, which is usually inconvenient.

Reply

Sign in to reply online Use email software

2348

Age (days ago)

2349

Last active (days ago)

Download

3 comments

4 participants

tags

participants (4)

Craig Henderson
Gavin Lambert
Niall Douglas
Richard Hodges
Sean Farrow