Skip to main content

Generation and validation of a formula to calculate hemoglobin loss on a cohort of healthy adults subjected to controlled blood loss



The ability to approximate intra-operative hemoglobin loss with reasonable precision and linearity is prerequisite for determination of a relevant surgical outcome parameter: This information enables comparison of surgical procedures between different techniques, surgeons or hospitals, and supports anticipation of transfusion needs. Different formulas have been proposed, but none of them were validated for accuracy, precision and linearity against a cohort with precisely measured hemoglobin loss and, possibly for that reason, neither has established itself as gold standard. We sought to identify the minimal dataset needed to generate reasonably precise and accurate hemoglobin loss prediction tools and to derive and validate an estimation formula.


Routinely available clinical and laboratory data from a cohort of 401 healthy individuals with controlled hemoglobin loss between 29 and 233 g were extracted from medical charts. Supervised learning algorithms were applied to identify a minimal data set and to generate and validate a formula for calculation of hemoglobin loss.


Of the classical supervised learning algorithms applied, the linear and Ridge regression models performed at least as well as the more complex models. Most straightforward to analyze and check for robustness, we proceeded with linear regression. Weight, height, sex and hemoglobin concentration before and on the morning after the intervention were sufficient to generate a formula for estimation of hemoglobin loss. The resulting model yields an outstanding R2 of 53.2% with similar precision throughout the entire range of volumes or donor sizes, thereby meaningfully outperforming previously proposed medical models.


The resulting formula will allow objective benchmarking of surgical blood loss, enabling informed decision making as to the need for pre-operative type-and-cross only vs. reservation of packed red cell units, depending on a patient’s anemia tolerance, and thus contributing to resource management.


Blood, or, more precisely, hemoglobin (Hb) loss during surgery is a risk factor for the requirement of a red blood cell (RBC) transfusion; blood loss is also a quality indicator for medical or surgical procedures [1,2,3,4,5,6,7,8,9,10,11,12]. Surgical (i.e. functional) success of a procedure provided, hemoglobin loss is thus one of the most salient outcome parameters of the quality of an intervention, whether for comparison of competing techniques, of hospitals, or of individual surgeons. Once the typical blood loss during a certain intervention has been established with some robustness, use of that value and a specific patient’s predicted individual anemia tolerance can be used to make decisions with respect to perioperative anemia management–will type and cross be required, will it be sufficient, or should RBC products be immediately available in the operating room–and can thus aid blood bank inventory management. While hemoglobin management and hemoglobin loss as a critical quality-defining outcome parameter are firmly established, useful, practical tools for assessment of intra-operative hemoglobin loss are lacking [13, 14]. Methods to gauge hemoglobin loss into surgical gauze, using photometric methods, were already proposed in the 1950s [15]. A number of simple formulas drawing on pre- and post-operative hemoglobin or hematocrit (Hct), typically incorporating total blood volume/RBC volume/ hemoglobin mass of the respective patient, have been proposed since [16,17,18,19,20,21,22]. None of these caught on, presumably because the former is cumbersome and the latter poorly validated, and a need to provide techniques for more accurate blood loss quantification was identified [14, 23]. The ascent of “big data” mathematical brute-force technologies as well as the ubiquity of handheld minicomputers which can accommodate complex estimation algorithms now allowed development of a refined formula for hemoglobin loss based on a large cohort of healthy individuals subjected to controlled hemoglobin loss. We identify a minimum dataset required for intraoperative hemoglobin loss calculation across a wide range equivalent to 0.5–4.5 RBC units, demonstrate insufficiency of established formulas with respect to accuracy and linearity, and propose a mathematically relatively complex new formula which is easily applied by entering the minimal data into a software application (web-app) or tabular calculation datasheet.

Donors and methods

The subjects of the controlled bleed were healthy allogeneic bone marrow donors. All had undergone extensive health assessment according to criteria exceeding donor assessment guidelines of JACIE and WMDA as well as European and national law as previously described [24] and were found healthy except for minor non-limiting conditions such as arterial hypertension, hyperlipidemia, asthma/allergies or hypothyroidism which were either medically controlled or did not require medication. Under general anesthesia, bone marrow was aspirated as described [25]; the total hemoglobin contained in the bone marrow product represents the “controlled hemoglobin loss”. Target volume was dictated by required stem cell dose, although aspiration volume could be limited by donor weight where the theoretical target volume exceeded 1.5% of body weight. During the marrow harvest donors received crystalline replacement fluid of approximately 1 L/h. Collection of one liter of bone marrow takes approximately 45 min. Complete blood count (CBC) and differential were measured on the day before marrow collection. Bone marrow was aspirated transcutaneously from the posterior iliac crest, i.e. unlike during most surgeries the procedure is not associated with unaccounted blood loss and can therefore be measured quite accurately. Bone marrow volume was gauged by weighing the product with precision scales and correcting for density according to hematocrit. Hemoglobin loss (g) was determined as the hemoglobin concentration (g/L) in the bone marrow product times its volume. After awaking from the anesthesia donors were offered liquids by mouth ad libitum and mobilized immediately. Donors were routinely discharged 24–28 h after the marrow harvest; post-procedure CBC was collected immediately prior to discharge. CBC in blood before and after the harvest and in bone marrow were measured with the same Sysmex XT1800 hemacytometer (Norderstedt, Germany) in our accredited laboratory which participates in quality circles; at least three daily controls ensure precision and accuracy of the measurements. The following data were collected and considered for the analyses: Donor height, weight, age, sex and BMI, CBC before and after in blood, CBC and volume in the product.

Mathematical analyses

The formula was generated by standard machine learning approaches. We strove to employ a broad range of supervised learning algorithms that would stress different aspects of the data, i.e. linear vs. non-linear relationships between independent and predicted variable, parametric vs. non-parametric models. Thus during a model selection phase we considered the following classical supervised learning algorithms.

  • Linear regression [26]

  • Ridge regression with penalty factor α = 5 [27]

  • AdaBoost with decision tree base learner and 100 estimators [28]

  • Random Forest with 40 decision trees each with max depth of 4 [29, 30]

  • Support vector regression with RBF kernel, C = 9 and ε = 0.9 [31, 32]

  • K-nearest neighbors with k = 20 [32, 33]

  • Neural network with 3 hidden layers, rectified linear unit (ReLU) activation function and the adam optimizer [34]

The key idea behind model selection in machine learning is to dispassionately try different models on the raw input data independent of the (medical) meaning of individual values, as opposed to classical statistical analysis with hypothesis-driven construction of relationships which can, by necessity, only reinforce known or suspected relationships. Not until after one or more promising models are identified, these are optimized as best as possible on the given data at which point knowledge about the meaning of the identified values can be integrated. Before running the machine learning algorithms we (partially) log-transformed the data, de-correlated some variables (HB_pre and HB_post) by differentiating them and eventually standard scaled all variables (deduct mean and divide by standard deviation) since some algorithms such as kNN require pre-standardization. Once we had settled on the regression model, we performed the analysis without standardization since such preprocessing is not needed for (most) linear models and we wanted to ensure interpretability of the resulting coefficients. With respect to decision trees, while they are mostly used for classification problems, they can be readily adjusted for regression problems. We typically use information gain (maximum reduction in weighted entropy to decide on the splitting attribute) in classification problems; by contrast, the mean squared error is the metric of choice for regression problems using decision trees. All necessary computations were performed in Python 3.8 by using the scikit learn application programming interface (API) [35, 36]. With respect to the data that proved useful, the approach ended up not being very different than what has been used previously, calculation of Hb loss from Hb dilution in blood, i.e. the result of rapid correction of blood volume after a blood loss. Our data representing a wide range of Hb loss values, this uniquely facilitated modeling the function to the data and thus generating the coveted linearity for our formula.

Ethics statement

This non-interventional retrospective analysis leverages on anonymous outcome data assembled for the legally mandated annual product quality report of our unit. The Ethics Committee of Goethe University Medical School has confirmed that neither specific donor consent nor approval of the Ethics Committee is required.


Donors, blood loss

478 healthy donors were subjected to bone marrow aspiration and data were recorded in the donor database. Data from 77 thereof were missing post-collection CBC. Only complete data sets were used; the controlled bleed cohort thus consisted of 401 adult donors, two-thirds male, with a mean age of 28 years (IQR 24–40 years, max. 60 years), mean height of 178 cm (IQR 170–184 cm), mean weight of 79 kg (IQR 67–89 kg) and mean BMI of 24.7 kg/m2 (IQR 22.3–27.4 kg/m2), i.e. 2%, 33% and 14% were underweight, overweight and obese, respectively. Total hemoglobin in the product ranged from 29–233 g (mean 113 g, IQR 88–141 g), i.e. the equivalent of 0.5–4.5 units of RBCs and thus representing a meaningful range of volumes for many procedures except some relatively bloodless or very large surgeries. No RBC transfusions were administered.

Determination of minimal data sets and development of optimal prediction algorithms:

The algorithm would be considered “optimal” if the least number of most easily available parameters yielded correct values across the entire range of blood loss volumes or donor sizes and inclusion of further values did not meaningfully improve the prediction, e.g. by narrowing its precision.

The raw dataset features information on the weight (kg), the height (cm), the sex, the age (years) and a number of measurements pre- and post-surgery of 401 donors with the corresponding observed HB loss (g) harvested into the BM product bag. To be precise, the pre- and post-surgery measurements encompass the hemoglobin concentration (g/L) (denoted HB_pre and HB_post) and hematocrit concentration (%) (HCT_pre and HCT_post). Moreover, pre-surgery thrombocytes (#/μL) (PLT_pre) and leucocytes (#/μL) (LEUCO_pre) were at our disposal.

We set out to develop a machine learning model to predict blood loss from the raw data by training the algorithm against the observed hemoglobin loss. A standard medical approach towards a problem of this nature would be to employ Nadler’s equation [37] to estimate a patient’s blood volume from the patient’s weight, height and sex. In combination with the difference of HB_pre and HB_post one can obtain an estimate of the hemoglobin loss which is essentially what the Meunier formula suggests [17]. Regressing the actual blood loss against this estimate yields an R2 value of 47.6%. This approach, close to the Meunier formula [17], will serve as our reference model. Since a standard normal distribution of the input variables and only moderate correlations between variables is desirable for certain supervised learning algorithms, we first investigated the distribution of the input variables and considered the correlation matrix. The former analysis suggested that weight, age, BMI and LEUCO_pre needed to be log-transformed. The correlation matrix based on these log-transformations in Fig. 1 indicates a particularly strong correlation between the pre- and post-surgery measurements (81% Pearson correlation coefficient between HB_pre and HB_post, 78% between HCT_pre and HCT_post and 95% and 97% for the pre- and post-surgery Hb and Hct values respectively) and strong correlations between height, weight and sex. We addressed the former correlations by introducing the auxiliary variable HB_diff defined as the difference between HB_pre and HB_post and in exchange dropping the variable HB_post. Hct values were decorrelated from Hb values and a corresponding auxiliary variable HCT_diff introduced. Regarding the latter correlations, we kept the variables as-is, but would pay close attention to potential multicollinearity issues in the resulting regression model. We followed a standard machine learning approach for model selection, that is, we initially split off a test dataset containing 25% of the samples. Subsequently, we performed a fivefold cross-validation grid search on the remaining train dataset for hyperparameter tuning. For the linear regression model, we dropped all variables that were not significant at a 5% threshold on the train dataset in an effort to design a simple model. Moreover, to emulate the relationship between the Hb concentration difference and the blood volume in determining hemoglobin loss we introduced an interaction term between estimated blood volume as calculated by the Nadler formula and HB_diff. Prediction accuracy was measured by the mean squared error (MSE) and the explained variance (EV). With n denoting the sample size, y = (yi)i[n] the vector of true blood loss and ŷ = i)I[n] the predicted blood loss, the formulas for the performance metrics are given by

Fig. 1
figure 1

Correlation matrix: Strength of correlations for variables including Hb loss to evaluate the need for transformation of the input data

$$MSE = \frac{{\sum\limits_{i = 1}^n {{{({{\hat y}_i} - {y_i})}^2}} }}{n} \quad EV = 1 - \frac{{Var(y - \hat y)}}{{Var(y)}}$$

Clearly, the smaller the MSE and the larger the EV, the better is the model.

Table 1 summarizes the MSE and EV for the three best-performing supervised learning models given in the previous section and the medical reference model on the test dataset. The fairly simple regression models performed meaningfully better than more advanced techniques such as neural networks. As the linear regression performed on par with Ridge regression, but is more straightforward to analyze and check for robustness, we proceeded with this model. The linear model yields an R2 of 53.2% thereby meaningfully outperforming the standard medical model. Due to the tractability of the ordinary least-squares-regression model, we were able to put forth simple formulas for women and men to predict hemoglobin loss (coefficients rounded to 3 decimals).

Table 1 Prediction accuracy for different statistical models

(HBfemale/male [hemoglobin loss in g], height [body height in m], weight [body weight in kg], HBpre/post [hemoglobin concentration before/after controlled hemoglobin loss in g/L]). An excel worksheet implementing these formulae is provided as Additional file 1.

All coefficients were significant at a < 0.1% significance level. We performed the standard diagnostic tests to ensure the assumptions of a linear regression model are satisfied and the model is robust. First, the condition number of 1.46 indicated that the model does not suffer from severe multicollinearity. Second, the partial regression and partial residual plots confirmed the linear relationship between the variables and hemoglobin loss. Third, a Bruesch-Pagan test found significant evidence of heteroscedasticity (p-value < 0.1%). To this end, we re-performed the regression model using a heteroscedasticity consistent covariance matrix to obtain robust coefficient standard errors. Under this heteroscedasticity robust model, we found that each variable remained significant at a < 0.1% significance level. Fourth, the Q-Q plot (Fig. 2a) indicated that the distribution of the residuals was reasonably close to the normal distribution with only very slightly heavier tails. Nevertheless, the Jarque–Bera test showed that our models violated the normal distribution assumption for the residuals. Last, plotting the leverage against the normalized residuals squares (Fig. 2b) indicated the presence of several outliers with large residuals or large leverage. In order to address the issues of non-normality of the residuals and presence of several outliers, we performed a robust regression with MM estimators. This robust regression yielded coefficients within one standard error of our original model, so we conclude that our original model is reasonably robust against the non-normality of residuals, heteroscedasticity and the presence of outliers. The prediction intervals of the heteroscedasticity robust model are depicted in Fig. 3. The code used to perform the analyses is provided as Additional files 2 and 3.

Fig. 2
figure 2

Robustness indicators of the proposed new prediction formula: QQ plot, displaying stringent linearity for theoretical (X-axis) over sample (Y-axis) quantiles (a) and leverage vs. normalized residuals squared plot, showing–as desired–significant clustering of values around the origin of the graph (b). Numbers in the figure are the tuple of the leverage of the data point and the normalized squared residual

Fig. 3:
figure 3

90% Prediction intervals for the heteroscedasticity robust model for the subgroups of male (a) and female (b) donors. The X-axis shows the product of the blood volume determined via the Nadler formula and the difference in Hb concentration before and after the bloodletting

Several blood or hemoglobin loss estimates have been proposed [16,17,18,19,20,21,22], their development footing on reasonable expectations, i.e. that blood volume would rapidly re-equilibrate, causing Hb dilution which would be reflective of Hb loss. The majority of these formulas were either not validated in individuals with volume-controlled bleeds, validated for one volume [17], or in a very small number of patients and targeting an Hct representing a very large blood loss, namely 30% [38]. We used our dataset to ascertain the quality of these alternative prediction algorithms, paying specific attention to the quality of the predictions at the extremes, i.e. small and large donors, small and large hemoglobin predicted loss, as well as linearity. Of note, our formula is not hypothesis-driven but generated using an unbiased “big data” approach where the use of any readily available observed values is permitted. The correlation coefficients of predicted and observed blood loss for the individual formulas are displayed in Table 2. Figure 4 features the plots of residual (actual versus predicted loss) against predicted loss. As is apparent, at the Hb dilution caused by a blood loss of 500 mL, all formulas calculate approximately the correct blood loss volume or the equivalent loss of Hb mass or RBC volume (residuals of approximately zero). Across all volumes, however, the fit of all the formulas is modest. In particular, depending on the model the average prediction is 10–30% below the actual loss. Moreover, all formulas systematically underestimate blood loss in low-prediction and overestimate blood loss in high-prediction intervals.

Table 2 Correlation coefficient and precision of prediction formulas
Fig. 4
figure 4

Residuals versus predicted values for the formulas of Mercuriali/ Lisander (same formula, so long as no blood is transfused; RBC loss in mL) (a), Gross, Bourne, Meunier and Ward (blood loss in mL) (b) and for the new formula presented here (Hb loss in g) (c). The significant slope of all formulas in a and b is apparent, indicating systematic underestimation of small and overestimation of large blood loss, as well as the mean of residuals for all formulas is markedly below zero, i.e. in the mean, blood loss is systematically underestimated by all these formulas. By contrast, the mean value of residuals and the gradient with the new formula (c) are zero, i.e. the predicted Hb loss is equally precise and accurate throughout the entire range of observations


The ability to determine intra-operative blood, RBC or hemoglobin loss is a highly meaningful measure for many quality management exercises, whether for comparison of surgical techniques [1,2,3,4,5,6,7,8,9,10,11,12], of different hospitals or of individual surgeons. For such exercises, linearity is the most important criterion, so that twice as high actual blood loss is reflected by twice as high a calculated value, importantly also at smaller blood volumes which do not change Hb much. “Linearity” is here used not to describe the mathematical transaction of observed values, but instead to indicate the correlation between observed and calculated blood loss. The existing formulas on average correctly calculate a blood loss of 500 mL as 500 mL, but a reduction of the blood loss by half (to 250 mL) is predicted as no blood loss at all. Using these formulas to calculate blood loss associated with a novel surgical technique will thus exaggerate the improvement brought on by the new technique. A formula which in addition to being linear is also accurate can serve to generate benchmarking data for certain surgical interventions, particularly if the precision is also quite good. For patients with reduced compensatory capacity, such data could then support decision making with respect to ordering of transfusion products and thus improve inventory management and reduce costs.

Surgeon- or anesthesist-estimated blood loss is highly imprecise, highly subjective and systematically much too low, i.e. of modest usefulness [13]. For many decades this problem has been recognized, and a number of more objective solutions have been proposed, namely simple mathematical formulas using pre- and post-surgical measures of hemoglobin or hematocrit [16,17,18,19,20,21,22]. Although the laboratory values needed for their application are readily available, none of the formulas have caught on, indicating their limited usefulness. And indeed as we are showing by comparing predicted and observed blood loss based on our controlled blood loss cohort, none of the formulas make predictions which are linear across a relevant range, let alone accurate or precise, as was expected based on their mathematical simplicity.

We therefore present a novel formula for hemoglobin loss calculation which was generated using modern mathematical tools. The formula is based on and was validated against a cohort of > 400 individuals subjected to controlled bloodletting across volumes equivalent to between 0.5 and 4.5 RBC units. The formula incorporates sex, height, weight and hemoglobin concentration in blood immediately before and 24–28 h after the controlled bloodletting. At a first glance rather surprisingly, of the different mathematical approaches applied, the fairly simple regression models performed much better than more advanced techniques such as neural networks, an observation we attribute to the limited size of the dataset. The precision of the novel formula is such that across the entire range of volumes–the equivalent of 29 to 233 g of hemoglobin–the hemoglobin loss was predicted with an average absolute error of 19.6 g (Table 2), the correlation between predicted and observed hemoglobin loss was strictly linear and the mean difference between predicted and observed hemoglobin loss for all donors was zero throughout the entire range of observed values. The accuracy of the determination is also not affected by donor body mass (not shown), likely due to the relatively modest effect of body mass on blood volume [39] compared to the modest precision of our formula. Whether blood loss in mL, RBC loss in mL, or hemoglobin loss in g is predicted is relatively immaterial due to the relatively tight correlation of the values and the ease with which they can be converted. Since the parameter primarily determining the need for “blood transfusion” treatment (together with the clinical picture) is hemoglobin, and since the remedy for severe anemia, packed red cells, comes in units of 50 g hemoglobin, we opted to generate a formula predicting hemoglobin loss. Pre-surgical Hb concentration can be used to translate hemoglobin loss to blood loss, multiplication of Hb loss with 2.87 roughly translates it to lost RBC volume except in patients with relevant micro- or macrocytic anemia.

While Hb loss from intra- and extravascular Hb stores during a bone marrow harvest can be very precisely ascertained, whether it appropriately models surgical blood loss is not self-evident. Several pieces of evidence suggest, however, that it is. First, where marrow (extravascular blood) is removed, it is replaced by blood. Second, if we look at Fig. 4b we see that the Meunier formula [17] which was developed and validated on the 500 mL blood loss of a blood donation had zero residuals at the 500 ml marrow aspiration volume. Thus for the one volume where data are available the effect of controlled bloodletting and marrow aspiration on Hb dilution is the same.

The price for the improved precision and linearity of the hemoglobin loss calculator is a formula too complex for mental arithmetic. To facilitate use of the formula in the clinic, we therefore generated a worksheet into which the relevant variables are entered and the predicted median and 90% range are calculated. The worksheet is included with the paper as Additional files 1, 2 and 3. For not Hb-relevant blood loss the formula will predict a range for Hb loss which includes zero. This is in consequence of the fact that significant blood loss can occur before Hb will begin to drop, but obviously unchanged Hb may also indicate the absence of a blood loss. Accordingly, for very minor blood loss as occurs during smaller surgical procedures like tonsillectomy [40] our tool is not going to be useful. The precision of the prediction being similar for all prediction algorithms is unsurprising since all are based on pre- and post-surgical Hb concentration.

Limitations of our study include that the formula will not be useful for estimation of minor, not Hb-relevant hemorrhage, and it was not validated for individuals with pre-existing anemia or not normovolemic patients, such as pregnant women. Minor Hb loss into connective tissue overlying the spina iliaca posterior superior from where bone marrow is aspirated will remain unaccounted for but is unlikely to relevantly affect the outcome of the analyses. Although the very refined study of Meunier et al.[17] teaches that normovolemia is not yet achieved within one day even after an incurred blood loss of only half a liter, a single blood draw after 24 h guided the development of our algorithm. It is possible that the timing of this blood draw is responsible for the modest precision of our algorithm. We did not test whether a blood draw immediately after the end of the procedure or at some much later time point could have improved prediction. We believe, however, that the morning-after blood draw is most easily integrated into clinical routines.

In summary, we have developed a calculator for estimation of surgical hemoglobin loss. Its strict linearity, high accuracy (mean error being zero) and reasonable precision predispose its use for intra- and inter-hospital benchmarking of surgical procedures and blood inventory management. Until the algorithm has been validated in an independent cohort it should not be used for medical decision-making.

Availability of data and materials

The datasets analysed during the current study are available from the corresponding author on reasonable request.



Application programming interface


Body mass index


Complete blood count


Explained variance






Inter-quartile range


Joint Accreditation Committee of the International Society of Cell Therapy ISCT and European Bone Marrow Transplantation Group EBMT


Mean squared error


Platelet, thrombocyte


Red blood cell, erythrocyte


Radial basis function


Rectified linear unit


World Marrow Donor Association


  1. Bentas W, Wolfram M, Jones J, et al. Robotic technology and the translation of open radical prostatectomy to laparoscopy: the early Frankfurt experience with robotic radical prostatectomy and one year follow-up. EurUrol. 2003;44:175–81.

    Google Scholar 

  2. Corrado G, Cutillo G, Pomati G, et al. Surgical and oncological outcome of robotic surgery compared to laparoscopic and abdominal surgery in the management of endometrial cancer. Eur J Surg Oncol. 2015;41:1074–81.

    Article  CAS  Google Scholar 

  3. Daylami R, Kargozaran H, Khatri VP. Liver resection using bipolar InLine multichannel radiofrequency device: impact on intra- and peri-operative outcomes. Eur J Surg Oncol. 2021;38:531–6.

    Article  Google Scholar 

  4. Djavan B, Agalliu I, Laze J, et al. Blood loss during radical prostatectomy: impact on clinical, oncological and functional outcomes and complication rates. BJU Int. 2012;110:69–75.

    Article  Google Scholar 

  5. Fagotti A, Gagliardi ML, Fanfani F, et al. Perioperative outcomes of total laparoendoscopic single-site hysterectomy versus total robotic hysterectomy in endometrial cancer patients: a multicentre study. Gynecol Oncol. 2012;125:552–5.

    Article  Google Scholar 

  6. Foss NB, Kehlet H. Hidden blood loss after surgery for hip fracture. J Bone Joint Surg Br. 2006;88:1053–9.

    Article  CAS  Google Scholar 

  7. Grochola LF, Soll C, Zehnder A, et al. Robot-assisted single-site compared with laparoscopic single-incision cholecystectomy for benign gallbladder disease: protocol for a randomized controlled trial. BMC Surg. 2017;17:13.

    Article  Google Scholar 

  8. Irisson E, Hemon Y, Pauly V, et al. Tranexamic acid reduces blood loss and financial cost in primary total hip and knee replacement surgery. OrthopTraumatolSurg Res. 2012;98:477–83.

    CAS  Google Scholar 

  9. Kunisaki C, Hatori S, Imada T, et al. Video-assisted thoracoscopic esophagectomy with a voice-controlled robot: the AESOP system. SurgLaparoscEndoscPercutan Tech. 2004;14:323–7.

    Google Scholar 

  10. Luo G, Fan H, Sun W, et al. Blood loss during extensive escharectomy and auto-microskin grafting in adult male major burn patients. Burns. 2011;37:790–3.

    Article  Google Scholar 

  11. Radomski M, Zenati M, Novak S, et al. Factors associated with prolonged hospitalization in patients undergoing pancreatoduodenectomy. Am J Surg. 2018;215:636–42.

    Article  Google Scholar 

  12. Soliman PT, Frumovitz M, Sun CC, et al. Radical hysterectomy: a comparison of surgical approaches after adoption of robotic surgery in gynecologic oncology. Gynecol Oncol. 2011;123:333–6.

    Article  Google Scholar 

  13. Budair B, Ahmed U, Hodson J, et al. Are we all guilty of under-estimating intra-operative blood loss during hip fracture surgery? J Orthop. 2017;14:81–4.

    Article  Google Scholar 

  14. Goodnough LT, Panigrahi AK. Estimating Blood Loss. AnesthAnalg. 2017;125:13–4.

    Google Scholar 

  15. Fábián A, Popovics M. EinfachesVerfahren zur Messung der operativenBlutverluste [A simple technique in measuring surgical blood loss]. Munch Med Wochenschr. 1954;96:1168–9.

    PubMed  Google Scholar 

  16. Moore RC Jr, Canizaro PC, Sawyer RB, et al. An evaluation of methods for measuring operative blood loss. AnesthAnalg. 1965;44:130–4.

    Google Scholar 

  17. Meunier A, Petersson A, Good L, et al. Validation of a haemoglobin dilution method for estimation of blood loss. Vox Sang. 2008;95:120–4.

    Article  CAS  Google Scholar 

  18. Mercuriali F, Inghilleri G. Proposal of an algorithm to help the choice of the best transfusion strategy. Curr Med Res Opin. 1996;13:465–78.

    Article  CAS  Google Scholar 

  19. Lisander B, Ivarsson I, Jacobsson SA. Intraoperative autotransfusion is associated with modest reduction of allogeneic transfusion in prosthetic hip surgery. Acta AnaesthesiolScand. 1998;42:707–12.

    Article  CAS  Google Scholar 

  20. Gross JB. Estimating allowable blood loss: corrected for dilution. Anesthesiology. 1983;58:277–80.

    Article  CAS  Google Scholar 

  21. Brecher ME, Monk T, Goodnough LT. A standardized method for calculating blood loss. Transfusion. 1997;37:1070–4.

    Article  CAS  Google Scholar 

  22. Bourke DL, Smith TC. Estimating allowable hemodilution. Anesthesiology. 1974;41:609–12.

    Article  CAS  Google Scholar 

  23. Lopez-Picado A, Albinarrate A, Barrachina B. Determination of perioperative blood loss: accuracy or approximation? AnesthAnalg. 2017;125:280–6.

    CAS  Google Scholar 

  24. Brauninger S, Thorausch K, Luxembourg B, et al. Deferrals of volunteer stem cell donors referred for evaluation for matched-unrelated stem cell donation. Bone Marrow Transplant. 2014;49:1419–25.

    Article  CAS  Google Scholar 

  25. Kim-Wanner SZ, Luxembourg B, Schmidt AH, et al. Introduction of principles of blood management to healthy donor bone marrow harvesting. Vox Sang. 2020;115:802–12.

    Article  Google Scholar 

  26. Galton F. Regression towards mediocrity in hereditary stature. J AnthropolInst G B Irel. 1986;15:246–63.

    Google Scholar 

  27. Gruber MHJ (1998) Improving efficiency by shrinkage: The James-Stein and ridge regression estimators. CRC Press.

  28. Chen T, Guestrin C (2016) XGBoost: A scalable tree boosting system. KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 785–794.

  29. D'Ambrosio A, Tutore VA (2011) Conditional classification trees by weighting the gini impurity measure. New Perspectives in Statistical Modeling and Data Analysis. Berlin Heidelberg: Springer; 273–280.

  30. Ho TK (1995) Random decision forests. Proceedings of 3rd International Conference on Document Analysis and Recognition 1995; 1:278–282.

  31. Vapnik V (2013) The Nature of Statistical Learning Theory. Springer Science & Business Media.

  32. Wang J, Chen Q, Chen Y (2004) Rbf kernel based support vector machine with universal approximation and its application. Advances in Neural Networks – ISNN 2004. Berlin Heidelberg: Springer; 512–517.

  33. Cover T. Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory. 1967;13:21–7.

    Article  Google Scholar 

  34. Kingma D, Ba J (2015) Adam: A method for stochastic optimization. International Conference on Learning Representations 2015.

  35. Buitinck L, Louppe G, londel M, , et al. API design for machine learning software: experiences from the scikit-learn project. ECMLPKDD Workshop: Languages for Data Mining and Machine Learning; 2013. p. 108–22.

    Google Scholar 

  36. Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in python. J Machine Learning Res. 2011;12:2825–30.

    Google Scholar 

  37. Nadler SB, Hidalgo JH, Bloch T. Prediction of blood volume in normal human adults. Surgery. 1962;51:224–32.

    PubMed  Google Scholar 

  38. Ward CF, Meathe EA, Benumof JLTF. A computer nomogram for blood loss replacement. Anesthesiology. 1980;53:S126.

    Article  Google Scholar 

  39. Lemmens HJ, Bernstein DP, Brodsky JB. Estimating blood volume in obese and morbidly obese patients. ObesSurg. 2006;16:773–6.

    Google Scholar 

  40. Baer GA, Rorarius MG, Kolehmainen S, et al. The effect of paracetamol or diclofenac administered before operation on postoperative pain and behaviour after adenoidectomy in small children. Anaesthesia. 1992;47:1078–80.

    Article  CAS  Google Scholar 

Download references


Not applicable.


Open Access funding enabled and organized by Projekt DEAL.. The work has not received external funding.

Author information

Authors and Affiliations



PL and MHK performed analyses and validations. SZKW, ES and HB conceived of the idea for the study and generated the data. PL, MHK and HB co-wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Halvard Bonig.

Ethics declarations

Ethics approval and consent to participate

This pseudonymized retrospective analysis of routine clinical data is part of our JACIE mandated ongoing product quality review activity. The Ethics Committee of Goethe University Medical Center has confirmed that no informed consent or ethics review is required for these activities.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Excel worksheet for volume loss calculation. Values are pasted into the respective fields, ENTER calculates the incurred blood loss (mean, range).

Additional file 2.

Model Selection

Additional file 3.

Linear Regression

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hahn-Klimroth, M., Loick, P., Kim-Wanner, SZ. et al. Generation and validation of a formula to calculate hemoglobin loss on a cohort of healthy adults subjected to controlled blood loss. J Transl Med 19, 116 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: