pprof

Author	SHA1	Message	Date
Raul Silvera	8ef477bf90	Improve support for the Comments field By default hide entries in the comment field that starts with #, and add a "comments" command to show all comments.	8 years ago
Raul Silvera	8f46b46449	Replace insns with insts for list of assembly instructions	8 years ago
Raul Silvera	4453e7e1b3	Add source information to disassembly reports Disassembly reports generated by pprof -disasm will now include line number information as generated by objdump. This will make the generated assembly more readable. As part of this I've introduced a new assemblyInstruction struct. Previously the code was reusing the graph.Node to represent assembly instructions but it seems better to have a dedicated type for this.	8 years ago
Raul Silvera	9bd8312bd7	Have unit= apply to numeric tags Format numeric tag values using the value of the -unit flag. Added a testcase for this behavior.	8 years ago
Raul Silvera	fbfe4810f1	Upstream minir changes from golang version of pprof - Remove dead code, unnecessary conversions - Consistently use a single space after period	8 years ago
Raul Silvera	e10c124884	Fix mean computation during graph creation When -mean is selected, currently pprof divides the sample value by value[0], which is expected to be the number of samples. This is intended to produce mean value per sample. These means cannot be added. Instead, we should add the value and the number of samples independently and perform the division at the end. To do this we will create a separate function to get the number of samples, and accumulate it independently from the sample value (weigth) and apply the division after the accumulation is completed.	8 years ago
Michael Pratt	77bbb10981	Instruction-level granularity in callgrind output When generating callgrind format output, produce cost lines at instruction granularity. This allows visualizers supporting the callgrind format to display instruction-level profiling information. We also need to provide the object file (ob=) in order for tools to find the object file to disassemble when displaying assembly. We opportunistically group cost lines corressponding to the same function together, reducing the number of superfluous description lines. Subposition compression (relative position numbering) is also used to reduce the output size.	9 years ago
Raul Silvera	68b2231948	Ignore system name when merging nodes This will allow merging nodes for multiple instantiations of C++ templates. A graph option is introduced to preserve older behavior.	9 years ago
Raul Silvera	81cfe92f9b	Disambiguate names for kcachegrind under the call_tree option When using the call_tree option and generating a graph for kcachegrind, it will merge back nodes that are distinct on the tree, producing some confusing results. Add a suffix so that these entries are kept separate. This addresses the problem described in http://yosefk.com/blog/how-profilers-lie-the-cases-of-gprof-and-kcachegrind.html , particularly the summary "Choosing a profiler is hard" section.	9 years ago
Wade Simba Khadder	7c0b6f60fd	Regresses NodeSet interface This is to keep the new TrimTree functionality from breaking any code currently using the public interface. We do this by separating NodeSet from nodePtrSet and creating different functions for each.	9 years ago
Wade Simba Khadder	928d30ac27	Makes TrimTree have no return type	9 years ago
Wade Simba Khadder	d51fda846e	Adds correct trimming with heuristics for trees	9 years ago
Wade Simba Khadder	bee282c062	Allow trees to trim to the top Nodes by NodeInfo In a graph, NodeInfo maps one to one to the nodes, so it suffices to just find the top NodeInfo s and only keep those nodes in the graph. In a tree however, a single NodeInfo may map to many nodes. As of this commit, a call to 'web 10' in pprof on a tree will return all the nodes corresponding with the top 10 NodeInfo s.	9 years ago
Wade Simba Khadder	8b643ccdaa	Fixs tests for updated report label	9 years ago
Wade Simba Khadder	f213286a00	Removes the cumulative barrier from the label It is no longer representative of how omissions are made. pprof uses more than just the cumulative value to make omissions.	9 years ago
Wade Simba Khadder	05fe768186	Makes Report Label use the actual node count Before this change, the node count used in the label is the proposed amount provided by the user. If some nodes were trimmed and the graph ended up with less nodes than the user asked for, the report label will now reflect this.	9 years ago
Raul Silvera	303a27381d	Add source_path option to point pprof to source files Currently pprof will look for source files only on the current directory and its parents. This makes it hard to examine sources on jobs where there are multiple source trees (eg from different libraries). Add a variable to provide a search path for source files. It will default to the cwd, so there will be no change in behavior by default.	9 years ago
Raul Silvera	12a41b40ad	Revert "Add mechanism to reduce profile size" This reverts commit 9c191ebbce. The trimproto option is no longer needed, removing it to avoid code growth.	9 years ago
Raul Silvera	198d319c7d	Ensure pprof call_tree output is a tree When generating a call tree, pprof was using a map to keep track of all the inline nodes for a location. That is incorrect as it may cause inline functions at different nesting levels to reuse the same node, causing the resulting graph to not be a tree. When creating a tree nodes with the same info may appear on multiple places in the tree. Keeping one of them preserves them all, which may cause disconnected nodes to remain. To ensure the resulting graph is a connected tree, do not include children on any removed node, which is suitable for the normal tree refinement (nodecount and nodefraction) but does not allow visual refinement, which may eliminate intermediate nodes. Disable visual mode refinement for call_tree to avoid this issue.	9 years ago
Raul Silvera	cf9a1f462e	Add diagnostics for symbolization without mapping filenames	9 years ago
Raul Silvera	9c191ebbce	Add mechanism to reduce profile size Add new trimproto option that generates a new profile while removing symbol information for functions below nodefraction. This reduces the profile sizes significantly.	9 years ago
Raul Silvera	e5e7096d4f	Speed up graph creation Separate implementation of graph and tree creation to speed it up. Graph implementation maps upfront all locations to sequences of nodes, tree implementation uses a per-parent map to keep track of a different node per location per parent.	9 years ago
Raul Silvera	164ad05478	Update tree structure to follow Go conventions Also update the build instructions to use go get	9 years ago
Raul Silvera	d174bbe741	first commit of OSS pprof	9 years ago

23 Commits (8ef477bf903ac8ff618ac07c4b00a8e9fc34585b)