Expose `pt_image_add`? #37

vext01 · 2018-03-06T15:20:58Z

Hi Markus,

Is there a way to load code into the image from a memory address of the current process?

It looks like pt_image_add might be what I want, but this is not exposed in libipt.

I'm trying to avoid having to dump the VDSO to file, just to immediately load it back in.

Come to think of it, if I can load code in from memory, since my process is tracing itself, I can load all of the code sections in from memory, thus avoid filesystem accesses entirely.

Thanks

The text was updated successfully, but these errors were encountered:

markus-metzger · 2018-03-06T15:40:26Z

Hello Edd,

you could register a read memory callback in struct pt_config. The decoder will call you for all addresses it cannot find in its image.

That won't work together with the block cache, though, since that is organized per image section. Decode will be significantly slower without the block cache.

We already support mmap()-based and fread()-based sections. We could add a third type for in-memory sections. But I would not do it without good reason. What's your motivation?

ck-on-github · 2018-03-06T15:45:57Z

I guess adding support for in-memory sections with the block cache would also be beneficial for JITted code. It would eliminate the need for dumping the code to files just for PT decoding purposes.

vext01 · 2018-03-06T16:34:40Z

We could add a third type for in-memory sections. But I would not do it without good reason. What's your motivation?

I'm writing a tracing JIT using PT for the trace collection component.

I'll have a profiling interpreter for some language which decides which part of a user-program are frequently executed. Once a location becomes "hot" I'll collect a PT trace, decode it, optimise it, and compile a trace for later executions of the same location.

Under this scenario, the traced and the tracing process is the same process (it traces itself), and all of the code needed to decode the trace is already in virtual memory. Ideally I'd just point libipt at the memory containing the code rather than reading anything from disk.

markus-metzger · 2018-03-07T11:26:44Z

You're also decoding in-process?

If I understood correctly, you'd just want to generate a single section spanning the entire address space. I see how this would be more convenient. But it would require a different organization of the block cache. I'd also expect the lookup to be slower.

If you're willing to create individual sections matching your code layout, we could keep the current block cache organization. But I'm not sure we really gain a lot by not dumping the memory into files, first. Of course, this is extra work, but how much is it really compared to decode and other overhead?

Adding an in-memory section type shouldn't be too difficult but it would still be a pity if we didn't gain anything.

Another aspect is self-modifying code. If the JITer is going to overwrite older versions of a JITed function (or otherwise re-use the memory), we'd have to dump them into files, anyway, unless we can make sure that they won't appear in any trace anymore.

vext01 · 2018-03-07T11:42:35Z

Hi Markus,

Yes, I'm decoding in-process.

Ideally it would be very useful to be able to tell libipt that all code comes from the current virtutal address space, but I don't mind loading the individual sections into the image if that would fit better with the architecture you already have.

As for tracing JIT code. In my use-case that will never happen. We will only trace code that was statically compiled.

markus-metzger · 2018-03-07T12:51:11Z

What improvements do you expect from an in-memory section? Have you profiled the code?

vext01 · 2018-03-07T14:32:50Z

The code is still being written, but I'll take it as a given that reading from memory is going to be faster than reading from disk.

Also, under the current API, I have to dump the VDSO to disk, and then have libipt read it back in, which seems odd/inefficient to me.

vext01 · 2018-03-21T15:48:17Z

What did you think of this Markus? Is my use case too niche for inclusion into libipt?

markus-metzger · 2018-03-22T10:35:28Z

I'm hesitating because it is not clear to me that this will result in a noticeable performance improvement.

vext01 · 2018-03-27T14:36:25Z

Having thought about this some more, I think the performance improvement would only be visible when loading the image which is a one-time event for most use cases.

I suppose I should have been spinning this an a usability improvement. It's a bit awkward having to dump the VDSO just to load it in later.

I've also noticed that during decoding libipt lazily loads from the VDSO file on demand. Because I'm using Rust (and I know this is my problem, not yours!) I have to pass the file handle around for the sole purpose of keeping the temporary file around long enough (as soon as it falls out of scope, it will be deleted).

vext01 mentioned this issue Mar 21, 2018

Load VDSO from memory ykjit/hwtracer#50

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose `pt_image_add`? #37

Expose `pt_image_add`? #37

vext01 commented Mar 6, 2018 •

edited

Loading

markus-metzger commented Mar 6, 2018

ck-on-github commented Mar 6, 2018

vext01 commented Mar 6, 2018

markus-metzger commented Mar 7, 2018

vext01 commented Mar 7, 2018 •

edited

Loading

markus-metzger commented Mar 7, 2018

vext01 commented Mar 7, 2018

vext01 commented Mar 21, 2018

markus-metzger commented Mar 22, 2018

vext01 commented Mar 27, 2018

Expose pt_image_add? #37

Expose pt_image_add? #37

Comments

vext01 commented Mar 6, 2018 • edited Loading

markus-metzger commented Mar 6, 2018

ck-on-github commented Mar 6, 2018

vext01 commented Mar 6, 2018

markus-metzger commented Mar 7, 2018

vext01 commented Mar 7, 2018 • edited Loading

markus-metzger commented Mar 7, 2018

vext01 commented Mar 7, 2018

vext01 commented Mar 21, 2018

markus-metzger commented Mar 22, 2018

vext01 commented Mar 27, 2018

Expose `pt_image_add`? #37

Expose `pt_image_add`? #37

vext01 commented Mar 6, 2018 •

edited

Loading

vext01 commented Mar 7, 2018 •

edited

Loading